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PREFACE 


Two principles have guided me during the writing of this book: (1) It 
should be written for mathematics majors at the Sophomore-Junior level; 
(2) it should still be adaptable enough for courses at different levels. It con- 
tains more expository material than the usual text; and the proofs of the 
theorems are presented in a gradual style, rather than being condensed to 
a set of terse statements. 

In order to make the book suitable for various courses, I have tried to 
include most of the number theory topics that are of interest to either under- 
graduate mathematics majors or to high school teachers of mathematics. 
Because some instructors require a course in modern algebra as a pre- 
requisite to the number theory course, several topics (including all of 
Chapter 7) are included that can be best appreciated by students with that 
background. Without exception these topics may be omitted with no loss 
of continuity. 

In Chapter 7 algebraic concepts are used in the treatment of arithmetic 
functions. This approach has several advantages over the usual method of 
presentation, not the least being that all mystery is removed from the role 
played by the Mobius function in the inversion of certain sums. Although 
no technical results from modern algebra are used in the presentation, it 
probably would be best to omit this chapter unless the students have success- 
fully completed a course in that subject. 

Of necessity Chapter 8 (on the distribution of prime numbers) is mainly 
expository. A few of the very elementary results are proved while, in Section 
8.4, a number of the more advanced results are stated without proof. 

Most of the sections contain sets of exercises or problems. Many of the 
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exercises are of a computational nature and are designed to illuminate the 
developed theory. Most of the problems, on the other hand, require some 
degree of originality on the part of the student and many of them are suitable 
for class discussion. With a few exceptions the text is independent of the 
exercises and problems. 

A star in the title of a section indicates that it is optional. Several of these 
sections are expository, containing historical information or statements of 
additional results, and should be assigned to be read. Others contain either 
special techniques or technical results. 

For a three-hour course at the Sophomore-Junior level my personal choice 
would be Chapters 1, 2, 3, 4, 6, 8, 9, 10, 11 (not necessarily in that order). 
For a class composed of prospective high school teachers, I prefer Chapters 
1, 2, 3, 4, 5, 6, 8, 11, including the appendices to Chapters 1, 3, and 4. 


James E. Shockley 


BLACKSBURG, VIRGINIA 
JANUARY 1967 
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1.1/ INTRODUCTION 


In the elementary branches of the theory of numbers we are concerned 
with the study of the integers—the numbers {---, —2, —1, 0, 1, 2, ---}—and 
with the problem of obtaining integral solutions to equations. 

This study is ancient, having, to some extent, arisen out of the pseudo- 
science of numerology. For example, the followers of Pythogaras (sixth 
century B.C.) classified the positive integers according to both arithmetical 
and magical properties and discovered a number of important relationships. 
By the time of Euclid (third century B.c.) a fairly extensive body of knowledge 
had been compiled, much of which was later lost to the western world. The 
Hindus in the fifth to eighth centuries A.D. made important advances in the 
theory, in some ways showing greater sophistication than their predecessors. 

The modern revival of the study began in the seventeenth century and is 
due primarily to the great French mathematician Pierre Fermat (1601-1665). 
Fermat, a lawyer who considered mathematics his avocation, had great 
insight into the properties of numbers. Although he did not publish his 
results, his correspondence with other mathematicians stimulated much re- 
search in this realm. His work was continued by two of the greatest mathema- 
ticians of all times, Leonhard Euler (1707-1783) and Carl F. Gauss (1777- 
1855), and by many other famous mathematicians. To these men we owe 
most of the theory that will be developed in this book. 

There are a few prerequisites to the study. In addition to the arithmetic 
properties of the number system, the reader should be familiar with the order 
properties of the integers (the “greater-than”? and “‘less-than” relations) 
and with the absolute value function. In a few places in the text, a knowledge 
of elementary analytic geometry is presupposed; and in Chapters 5 and 12, 
the basic limit theorems from calculus are used. 
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1.2 / THE PRINCIPLES OF WELL-ORDERING AND 
MATHEMATICAL INDUCTION 


The most important subset of the integers is the set of positive integers 
(natural numbers, counting numbers): the numbers {1, 2, 3, ---}. In this section 
we discuss two important properties of these numbers: The well-ordering 
principle and the principle of mathematical induction. Either of these can be 
proved as a theorem if the other is assumed. (They are said to be equivalent 
because of this.) The well-ordering principle is the simplest of the two and, 
in this book, will be used more often than the principle of mathematical 
induction. 


ms WELL-ORDERING PRINCIPLE / Any nonvoid set of positive integers 
has a smallest member. 


Using the well-ordering principle it is easy to prove a similar well-ordering 
principle for nonnegative integers. Hereafter, we will use either of these. 


Example |. 
Prove a well-ordering principle for nonnegative integers. 


SOLUTION: Let S be a nonvoid set of nonnegative integers. We must prove that 
S has a smallest member. Two cases arise. If the number zero is in S, then it is 
less than any positive member of S and is thus the smallest member. If zero is not 
in S, then S consists entirely of positive integers and, from the well-ordering prin- 
ciple, it follows that S has a smallest member. In either case, we have shown that 
S has a smallest member. 


m PRINCIPLE OF MATHEMATICAL INDUCTION / Jf P is a set of 
positive integers that has the following two properties, then P is the set of 
all positive integers: 

Property I. 1 is in P. 

Property II. If the integer k is in P, the integer (k + 1) must also be in P. 


Many readers will have seen the following modified form of the princfple 
of mathematical induction: 

A proposition about positive integers that has the following two properties is 
true for all positive integers: 

Property I’. The proposition is true for the integer 1. 
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Property II’. If the proposition is true for the integer k, it must also be true 
for the integer (k + 1). 

The fact that the two forms are equivalent should be obvious after reflection 
on the part of the reader. 


Example 2. 
Prove that 1 is the smallest positive integer. 


SOLUTION I: (Based on Principle of Mathematical Induction.) Let S be the set of 
all positive integers greater than or equal to 1. It is trivial that 1 is in S. If the 
integer k is in S, kK >1. Therefore, 


k+1>k>1 


and so the integer k+ 1 is in S. It follows from the principle of mathematical 
induction that S is the set of all positive integers. Therefore, each positive integer 
is greater than or equal to 1. 


SOLUTION II: (Based on Well-Ordering Principle.) It follows from the well- 
ordering principle that there is a smallest positive integer, which we call s. Assume 
that s< 1. We multiply the inequality 


0<s<1 
by s, obtaining 
0<s*<s 


which implies that s is not the smallest positive integer. Because our assumption 
led to a contradiction, it is false. Therefore, 1 is the smallest positive integer. 


Example 3. 


Prove the formula for the sum of the first m terms in an arithmetic progression: 


n(n — 1) 


5 d 


at(a+d)+(a+2d)+ + +fa+(n—1d] = na+ 


SOLUTION: The formula is obviously correct for the special case n = 1, because 
the left- and right-hand sides of the equation are equal to a. Assume the formula 
holds for the positive integer k; that is, assume 

k(k — 1 
at+(a+d)+(a+2da+°:4+fat+k-—- Dal = ka + (1.1) 
By adding a+ kd to both sides of (1.1), we obtain 
a+(a+d)+ + + fa+ (k —1)d]+ (a+ kd) 


k(k —1 k+1)k 
om d+ (at kd) =(k+ tat St, 


=ka+ d 
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Thus, if the formula holds for the integer k, it must also hold for the integer k + 1. 
It follows from the principle of mathematical induction that the formula holds for 
each positive integer n. 


Example 4. - 


Let a and x be positive integers. Prove that a positive integer m exists such that 
am >n. 


SOLUTION: The proposition is obviously true when n = 1, for 2a > 1. 
Suppose the proposition is true for the case n = k; that is, an integer 7, exists 
such that am, > k. Then 


am, +1) =am+a>k+a>k+1 


so that the proposition is also true for the case n=k-+ 1. It follows from the 
principle of mathematical induction that the proposition is true for each positive 
integer 7. 


EXERCISES 


1. Prove the formula for the sum of the first n terms in a geometric 
progression: 


atart+ar?+-+ar~*= 


— (r #1) 


2. Prove that if n is a positive integer, then 
4274+ -- ¢n°=(14+24+-- +7) 


The following problem was devised by E. Lucas, a prominent nineteenth- 
century French mathematician: 


3. In a temple in India stand three upright posts. At the beginning of time 
(for our purposes we shall assumed this was 10,000 B.c.), 64 concentric disks 
were placed on one of the posts; the largest disk at the bottom, the next 
largest resting on it, and so on. Priests have been engaged in shifting the 
disks to other posts, subject to the following laws: 
1. Only one disk may be moved at a time. 
2. A disk may never be placed over a smaller disk. 
When all 64 disks have been moved to one of the other posts the world will 
come to an end. 

Assume that the priests have been working in shifts, moving 1 disk per 
second for 24 hours a day, and have never made a mistake. How much 
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longer will the world exist? [Hint: Use mathematical induction to obtain a 
formula for the minimum number of moves needed to shift the first m disks 
to one of the other posts. ] 


PROBLEMS 


1. Use the well-ordering principle to prove that a nonvoid bounded set of 
integers must contain a largest element. [Hint: Suppose that each element in 
the set is less than n. Consider the set of all numbers of form n — s, where s 
is in the original set. ] 


2. Let n be a positive integer and let k be an integer in the range O<k <n. 


Define the binomial coefficient (7) by 


() "ew 


Prove that 7 is always an integer. [Hint: Prove first that if 0 < k < n, then 


+ 
(i) + (ea) = (es) 


3. The Binomial Theorem. Let « and B be numbers. Let n be a positive integer. 
Prove by mathematical induction that 


(a + p)" = (5)e" 4 (Jar Ben asd (7) 98 er (") 


1.3* / EQUIVALENCE OF THE PRINCIPLES OF MATHEMATICAL 
INDUCTION AND WELL-ORDERING 


In this section, we will establish that either the principle of mathematical 
induction or the well-ordering principle can be proved as a theorem, given 
the other principle and the arithmetic properties of the integers. 

The following result is needed in the proof of the theorem. The proof is 
left as an exercise. 


= LEMMA / If k is an integer, there is no integer between k and k+ 1 
(exclusive). 


= THEOREM 1/ The principles of mathematical induction and _ well- 
ordering are equivalent. 
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PROOF: (Part I) Assume that the well-ordering principle holds for the 
positive integers. Let S be a set of positive integers with the properties: 


(1) lisin S. 
(2) If k isin S, k + 1 1s also in S. 


We must prove that S is the set of all positive integers. 

Let T be the set of all positive integers not in S. If JT is nonvoid, it follows 
from the well-ordering principle that T has a smallest member, ¢. Since 1 is 
in S and 1 is the smallest positive integer (established in Solution II of 
Example 2, Section 1.2, using the well-ordering principle), then ¢ > 1. There- 
fore, t — 1 is a positive integer that must be in S since t — 1 < t. The second 
property of S insures that ¢, equal to (t— 1) +1, is also in S—a contra- 
diction. Because our assumption that T is nonvoid leads to a contradiction, 
it is false. Therefore, T is void and S is the set of all positive integers. 

(Part If) Assume that the principle of mathematical induction holds for 
the positive integers. Assume also that there exists a nonvoid set of positive 
integers, S, that does not have a smallest member. Because | is the smallest 
positive integer (established in Solution I of Example 2, Section 1.2, using 
the principle of mathematical induction), then 1 is not in S and is smaller 
than all of the elements of S. 

Let T be the set of all positive integers that are less than all of the elements 
of S. We have established that 1 is in 7. Suppose the integer k is in 7. If 
k + 1 is in S, then, because there is no integer between k and k + 1 (by the 
lemma), k + 1 is the smallest element of S—a contradiction to our assump- 
tion about S. Therefore, if k is in 7, k + 1 must also be in T. It follows from 
the principle of mathematical induction that 7 contains all the positive 
integers and, consequently, S is void. This is a contradiction to our original 
assumption that S is nonvoid. Therefore, if S is a nonvoid set of positive 
integers, then S has a smallest element. 


There is a variation on the principle of mathematical induction that, in 
some cases, is easier to apply: the second principle of mathematical induction. 


m= SECOND PRINCIPLE OF MATHEMATICAL INDUCTION / /f S is a 
set of positive integers that has the following two properties, then S is the set 
of all positive integers: 

Property I. 1 is in S. 

Property IT. If all the positive integers less than k are in S, k must also be 
in S. 


The proof that the second principle of mathematical induction is equivalent 
to the principle of mathematical induction and to the well-ordering principle 
is left for the reader (Problem 1). 


1.4 | DIVISIBILITY OF INTEGERS « 7 


EXERCISE 


1. Prove the lemma. [Hint: Assume there is an integer n such that 
k<n<k +1, and consider the integer n — k.] 


PROBLEM 


1. Prove that the second principle of mathematical induction is equivalent 
to both the principle of mathematical induction and the well-ordering 
principle. 


1.4 / DIVISIBILITY OF INTEGERS 


m DEFINITION / Let a and b be integers. If an integer m exists such that 
b = ma, we Say that a divides b and that b is a multiple of a. We denote this 
fact by writing “a|b.”’ 


For example, 2|4, 7|21, 0]0, 3/0. 

The symbol a|b should not be confused with the rational number a/b. 

Some of the basic properties of the divisibility relationship are established 
in the next theorem. 


m= THEOREM 2/ (a) If alb and dlc, then alc, 

(b) If ajb and alc and if m and n are integers, then 
a\(mb + nc), 

(c) If ajb and b £0, then 0 < |a| < |d|, 

(d) If a|b and bla, then a = +5. 


PROOF: (a) If a|b and blc, there exist integers m and n such that b = ma, 
c = nb. Substituting, we obtain 
c =nma 
Since nm is an integer, this means that alc. 
(b) If alb and alc, there exist integers x and y such that b = ax, c =ay. 
Thus, 
mb + nc = max + nay = a(mx + ny) 
Since mx + ny is an integer, a|(mb + nc). 
(c) If a|b, there exists an integer m such that b = ma. Since b 40, neither 
a nor m is zero, so that 1 < |a| and 1 < |m|. Therefore, 


|b] = |am| = |a|- |m| = lal - 1 > 0 
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(d) Suppose that alb and bla. If b=0, then obviously a=0 and so 
a= +b. If b £0, it follows from (c) that |a| = |b|, which is only possible 
ifa= +b. 


THE DIVISION ALGORITHM 


The next theorem asserts that “‘long division is possible.”’ This is a funda- 
mental result and will be used many times in the remainder of this book. 


= THEOREM 3/ (The Division Algorithm.) Let a and b be integers, b 4 0. 
There exist unique integers g and r, 0 < r < |b| such that 
a=bq+r 


(The numbers q and r are called the quotient and remainder when a is 
divided by b.) 


PROOF: Let S be the set of all integers of form a+ kb where k ranges 
over the integers, that is, 
S ={---,a—2b,a—b,a,a+b,a+ 2b, ---} 


Observe that the integer 
a+ ({a| + 1)|d| 


is nonnegative and is in S. It follows from the well-ordering principle for 
nonnegative integers that S contains a smallest nonnegative integer r. Then 
r—ais a multiple of b; therefore, we may write 


a=bq+r 


If r > |b|, the integer r — |b| is a nonnegative element of S, smaller than r, 
which is impossible. Therefore, 


O<r< |b 
To show that g and r are unique, suppose we can also write 
a=bq,+r, O0<r, < |d| 
where g, and r, are integers. Then 
bq, -—g)=r-ry 
so that b\(r — r;) 


It follows from Theorem 2(c) that if r#r,, then |r—r,| = |b|. But this is 
impossible, because 


1.5 | THE GREATEST COMMON DIVISOR uu 9 
“SS 
Therefore, r=r,. Since b(q, —q) =r—r, =O and b £0, then g, —q =0. 
Therefore, g = q,. 


The division algorithm is frequently used in classifying the integers. For 
example, if a =2q+ 1, we say that a is odd, and if a=2q+0 that a is 
even. Similar classifications may be made with respect to division by 3, 4, 
and so on. 


EXERCISES 


1. Discuss the expression “‘a|b’’ when a and/or 5b is zero. 


2. Write a in the form a = bg+7r,0<r< |b}: 
(a)a=—17,b=5; 
(b) a= —39, b= —8. 


3. Prove that the integer q+ (ja| + 1)|b], mentioned in the proof of 
Theorem 2, is in the set S and is nonnegative. 


4. Prove that: 
(a) The sum of two even integers is even. 
(b) The sum of two odd integers is even. 
(c) The product of two even integers is even. 
(d) The product of two odd integers is odd. 


5. Show that an odd integer is of form 4n + 1 or 4n + 3. 


6. If a is an odd integer, prove that a* leaves a remainder of 1 when 
divided by 8. 


1.5 / THE GREATEST COMMON DIVISOR 


It is obvious that if a and b are not both zero, there is a largest positive 
integer that divides both a and b. (See also Problem 1.) 


= DEFINITION / Let aand bd be integers, not both equal to zero. The largest 
positive integer that divides both a and 5b is called the greatest common 
divisor (g. c. d.) of a and b. This number is denoted by the symbol (a, b). 


For example, (3, 9) = 3, (—7, 21) = 7, (0, —5) =5. 
In a similar way we define the greatest common divisor of a finite set of 
integers a,, a, ---, a, (not all zero) to be the largest positive integer that 
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divides all of the numbers. This number is denoted by the symbol (a,, a,, 
“+, Ay). 


= DEFINITION / Two integers are said to be relatively prime if their greatest 
common divisor is 1. 


A number of the properties of the greatest common divisor can be estab- 
lished directly from the definition. Our work will be expedited, however, if 
we first prove a fundamental characterization of the greatest common divisor. 


m THEOREM 4/ The greatest common divisor of a,, a, ---, a, is the smallest 
positive integer that can be written as a sum of multiples of a,, a, ---, a,. 


PROOF: Because at least one of the given integers, say a;, must be nonzero 
(in order to have the greatest common divisor defined), then obviously at 
least one positive integer can be written as a sum of multiples of a,, a, ---, a,;3 
namely, 


la;|| =O-a, ++» +(4)a;+---+0-a, 


It follows from the well-ordering principle that a smallest such positive 
integer exists, which we will call s. We will prove that 


5 = (4, a2, +++, a) 
Since s is a sum of multiples of a,, a2, ---, a, we can write 
S=M,a, + m,a, + ---mM,4a, (1.2) 
If we divide a, by s, obtaining 
a,=sqtr O<r<s 
and substitute into Equation (1.2), we obtain 
r =a, — sq =a,(1 — mq) + a,(—m2q) + +++ + a,(—m,g) 


so that the nonnegative integer r is also a sum of multiples of a,, a,, ---, d,,. 
Because r < 5s, this implies that r =0. Therefore, s|a,. In a similar way, it 
follows that s is a common divisor of a,, a2, ---, a,- 

Suppose now that d = (a,, a, ---, a,). Since s is a common divisor, then s 
is not greater than d, the greatest common divisor. On the other hand, 
because d divides each a,, it follows from Equation (1.2) that d|s, so that 
[by Theorem 2(c)] d < s. Therefore, 


s=d= (a;, Q2, °°", An) 


The following corollary states a basic property of the greatest common 
divisor. 
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m COROLLARY 4.1 / If d is a common divisor of 4,, a), ---,a,, then d 
divides (a,, a3, ---, d,). 


PROOF: If d divides a,, a2, -:-, a,, then d divides any sum of multiples of 
Q,, 4), °**, a,; 1m particular, d divides the greatest common divisor of these 
numbers. 2 


We will next show that the greatest common divisor of a set of integers 
can be found by a series of steps, each involving the greatest common divisor 
of two integers. 


m COROLLARY 4.2 / If a,, a,, ---, a, are nonzero integers, then 


(a,, Q2, °°", An) oe ((a,, Q2, °°", Ay - i)» A). 


PROOF: Let d = (a, ay, ---, a,). Then d divides each a; (i = 1, 2, ---,n — 1); 
therefore by Corollary 4.1, d|(a,, ---, a4, — ,). Since d also divides a,, we may 
apply Corollary 4.1 again; we find that 


d\((a;, 79°, A, — i) an) 


On the other hand, the integer ((a,, ---, a, — 1), @,) 18 a common divisor of 
(a;,°°',@,— ,) and a,, implying that it divides a,, a,, ---, a,. Therefore, by 
Corollary 4.1, 

((a,, "ety Ay — 1) a,)\|d 


Because ((a,, --:, 2, — 1); @,) and d are positive, it follows from Theorem 2(d) 
that 
d = (a, Az, °*'s a,,) aa ((a,, ty A, — i) an) 


In Section 1.7 we will devise a method to calculate the greatest common 


divisor of two integers. Assuming that this can be done, we work the following 
example: 


Example. 
Calculate (1029, 1911, 9177). 


SOLUTION: Because (1029, 1911) = 91, 
(1029, 1911, 9177) = (1029, 1911), 9177) = (91,9177) =7 


m COROLLARY 4.3 / If d=(a, b), the integers a/d and b/d are relatively 
prime. 
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PROOF: It is obvious that a/d and b/d are integers. From Theorem 4 it 
follows that 
d=ma-+tnb 


where m and n are integers. Therefore, 
1 = ma/d + nb/d 


so that 1 is obviously the smallest positive sum of multiples of a/d and b/d. 
Therefore, by Theorem 4, a/d and b/d are relatively prime. 


m COROLLARY 4.4 / Let a and 5b be relatively prime integers. If albc, 
then alc. 


PROOF: Since a and b are relatively prime, we may write 
1 =ma+nb 


Because albc, there is an integer k such that bc = ak. If we multiply the 
above equation by c, we obtain 


c = mac + nbc = mac + nak = a(mc + nk) 
so that alc. 


This result can be extended to more than two factors. The proof is left 
for the reader. 


m COROLLARY 4.5 /If alb,b,---b,, and a is relatively prime to 
6, (Vj = 1, 2, +++, n — 1), then ald, = 


In Section 1.7, we will need the following corollary: 
m COROLLARY 4.6 / If a=bq+rand b £0, then (a, b) = (0, r). 

PROOF: Obviously, any sum of multiples of a and b can also be written 
as a sum of multiples of b and r, and conversely. Thus, it follows from 


Theorem 4 that the greatest common divisor of a and b is equal to the greatest 
common divisor of b and r. 


EXERCISES 


1. Use Corollary 4.2 to calculate (381, —216, 48, 918). 


2. Discuss the symbol “‘(a, b)”’ if a and/or b is zero. 
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3. Prove that two consecutive integers are relatively prime. 


4. Prove that if (a,b)=1, then (a—b, a+b)=1 or 2. [Hint: Use 
Theorem 2(5).] 


PROBLEMS 


1. Use Problem 1 of Section 1.2 and Theorem 2(c) to prove that if 
Q,, Az, ***, a, are not all zero, there is a largest positive integer that divides 
these integers. 


2. Prove: If a, b, and m are nonzero integers, m > 0, then 
(ma, mb) = m(a, b) 
3. Use Theorem 4 to prove: If (a, b) = (a, c) = 1, then 
(a, bc) = 1 


4. Prove Corollary 4.2 without using Theorem 4 or Corollary 4.1. 


1.6 / THE LEAST COMMON MULTIPLE 


It follows from the well-ordering principle that if a and b are nonzero 
integers, there is a smallest positive integer that is a multiple of both a and b. 
This integer is called the least common multiple (1. c. m.) of a and 5b and is 
denoted by the symbol [a, b]. The least common multiple of a finite set of 
nonzero integers is defined similarly. 

For example, —60 is a common multiple of —6 and —15, but [—6, —15] 
= 30. 

There is a close relationship between the greatest common divisor and the 
least common multiple of two nonzero integers. Because of this, it is fre- 
quently possible to establish properties of one of the quantities if similar 
properties are known to hold for the other. This relationship is given in 
Theorem 6. Before proving it we state a needed result. The proof is left for 
the reader (Problem 1). 


=» THEOREM 5/ If misacommon multiple of the nonzero integers a and b, 
then [a, b]|m. 


=» THEOREM 6/ If a and 6 are nonzero integers, then 
[a, b\(a, b) = |ab| 
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PROOF: Let d=(a,b) and m=l[a, b]. The rational number |ab|/d is 
obviously an integer. Since 


Ma (28)-0-(29 


then |ab|/d is a positive common multiple of a and b, and is no smaller than 
the least common multiple. Therefore, 


eed =m (1.3) 


On the other hand, because |ab| is a common multiple of a and JB, it follows 
from Theorem 5 that |ab|/m is a positive integer. Since m is a multiple of a, 
say m = ka, then 


and, therefore, 


In a similar way we see that (|ab|/m)|a. Thus, |ab|/m is a positive common 
divisor of a and b and is no larger than the greatest common divisor, that is, 


18h 24 
m e 


Combining this result with inequality (1.3), we obtain 


|ab| = md = [a, b\(a, b) 


EXERCISES 


1. Calculate [—10, —15, —20, —25]. [Hint: See Problem 3.] 


2. If a and b are relatively prime positive integers, prove that 
[a, b] =ab 
3. Does the relationship [a,, a,, a3] (@;, 4, 43) = | a,a,a3| hold in general? 


4. Let a, b, m, and n be positive integers, (m, n) = 1. If a/b = m/n, prove that 
a positive integer ¢ exists such that a = tm, b = tn. 


5. Prove that the product of three consecutive integers is divisible by 6. 
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PROBLEMS 
1. Prove Theorem 5. [Hint: Divide m by [a, b] obtaining m = [a, b]q +1, 
0<r<  [a, b]. Show that r is a common multiple of a and b.] 


2. Use Theorem 6 and Problem 2 of 1.5 to prove: If m is a positive integer, 
then [ma, mb] = m[a, b]. 


3. Prove: [a,, Az, °°", a,| = [[a,, Ag, °°", Ay il, a,,]. 


1.7 / THE EUCLIDEAN ALGORITHM 


The Greek mathematician Euclid (300 B.c.) is credited with the following 
method for calculating the greatest common divisor of two integers. Because 
of this, the method is called the Euclidean algorithm. 


If the integers a and b are to have a greatest common divisor, one of them, 
say b, must be nonzero. The first step is to divide a by b, obtaining 
a=bq+r O0<r< |b 
If r = 0 we stop. If r > 0, we divide b by r, obtaining 
b=rq, +r, O<r<r 
If r, = 0 we stop. Otherwise, we continue as before, obtaining 
. r="\G2 +r O<r,<7r, 


We now continue this process until we have a zero remainder, obtaining the 
chain of equations 


a=bq+r O0<r< |b 
b=rq, +r, O<r,<r 


r=1P\q, +12 O0<r,<r; 


rn—2=ln-19n T ln O<r,<Tp-{ 
ln -1 =n + 1 


It is obvious that we must eventually obtain a zero remainder because the 
sequence |b|, r,, rz, --: is a decreasing sequence of positive integers (use of the 
well-ordering principle) and we are assuming that r, is the last nonzero 
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remainder. We now apply Corollary 4.6 n + 2 times, obtaining 
(a, b) = (0, r) =(r, r;) =(r,, rz) = aah (ee rn) = (Fr, 0) =!) 


Thus, the greatest common divisor of a and b is equal to the last nonzero 
remainder obtained by the successive divisions in the Euclidean algorithm. 


Example I. 


(a) Use the Euclidean algorithm to calculate the greatest common divisor of 
232 and 136. 
(b) Calculate the least common multiple of 232 and 136. 


SOLUTION: (a) 232 = 1: 136+ 96 
136=1- 96+ 40 
96=2- 40+ 16 
40=2- 16+ 8 
146=2- 8+ 0 
Therefore, (232, 136) = the last nonzero remainder = 8. 
(b) Using Theorem 6, we find that 


232-136 31,552 
(232, 136] = ———— = — > = 3944 


The basic theorem proved about the greatest common divisor is Theorem 4: 
The greatest common divisor is the smallest sum of multiples. We can use the 
Euclidean algorithm to write the greatest common divisor in this manner. 
The process is as follows: Solving the next to the last equation for r,, we 
can express r, in terms of r, _ , and r, _ ,. Substituting the value of r, _ , from 
the second equation from the bottom, we can write r, as in terms of r, _ 5 
and r,— 3. Continuing this successive substitution we eventually write 
r, = (a, b) as a sum of multiples of a and b. 


Example 2. 
Write (136, 232) as a sum of multiples of 136 and 232. 


SOLUTION: From the next to the last line in the chain of equations in Example 1, 


we obtain 
8 = 40 —2: 16 


Substituting the value of 16, we obtain 


8 = 40 — 296 — 2-40) = 5-40 —2 : 96 
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Continuing the successive substitution, 
8 = 5(136 — 96) —2 -96 = 5 - 136 —7- 96 
= § + 136 — 7(232 — 136) = 12: 136 —7 - 232 


The work involved in the Euclidean algorithm can frequently be shortened 
if we make one modification: Instead of choosing the smallest nonzero re- 
mainder at each step, choose the remainder that is smallest in absolute value. 
If this is done, the greatest common divisor is equal to the absolute value of 
the last nonzero remainder. 


EXERCISES 


1. Use the Euclidean algorithm to calculate (187, 221) and write it as a sum 
sum of multiples of 187 and 221. 


2. (a) Use the Euclidean algorithm and Corollary 4.2 to calculate (714, 2030, 
2205) and write it as a sum of multiples of 714, 2030, and 2205. 
(b) Calculate [714, 2030, 2205]. 


1.8 / THE UNIQUE FACTORIZATION THEOREM 


w DEFINITION / An integer greater than 1, not divisible by any smaller 
positive integer except 1, is called a prime number. A positive integer greater 
than 1 which is not a prime number is called a composite number. 


For example, the first few primes are 2, 3, 5, 7, 11, and 13. 

It is obvious that if p is a prime number and a is an integer, then (a, p) is 
equal to | or p. 

The theorem proved in this section is the unique factorization theorem for 
integers. This theorem is of such importance that it is frequently called the 
fundamental theorem of arithmetic. 


m THEOREM 7/ An integer greater than | is either a prime number or is 
a product of prime numbers. Furthermore, the factorization into primes is 
unique except for the order of the factors. 


PROOF: We will first prove the (intuitively obvious) fact that a composite 
number can be factored as a product of primes. 

Suppose the proposition is false. Then there is a composite number that 
cannot be written as a product of primes. Let n be the smallest such number. 
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Since ” is composite we can write 
n = ab l<a<n l<b<n 


Because a is smaller than n then a is a prime or is a product of primes; a 
similar statement holds for b. But then n = ab 1s a product of primes, which 
is a contradiction. Therefore, each composite number can be factored as a 
product of prime numbers. 

We will now show that the factorization into primes is unique. 

Suppose that a composite number exists for which the factorization into 
primes is not unique. Let 


Mm = Pi P2°°* Pj = W192 °°" Uk 


by the smallest such number, where p,, D2, °°, G;. — 1, G, are prime numbers. It 
follows from the remark preceding the statement of the theorem that if 
P, #4q;, then (p,,q;) =1. Thus, if p, #9; for i=1, 2, ---, k — 1, it follows 
from Corollary 4.5 that p, =q,. In other words, p, must be equal to one of 
the g’s. For purposes of notation we will assumed that p, = q,. Then 


m 
— = P2P3°** Pj = 4293 "°° Ak 
Pr 
If k > 2, then m is not the smallest integer for which the factorization is not 
unique—a contradiction. If k = 2, 


m 
— = P2P3 °° Pj = 2 
Pi 


which implies that j = 2. Thus, 


m 
ves sae. 
Py 
and the factorization of m is unique, which is also a contradiction. Therefore, 
the factorization into primes is unique for any positive integer. 


It follows from Theorem 7 that a composite integer n can be written in 
the form 


nas py Og De Oe 
Nn=Pp; P2 PR 


where the exponents are nonnegative integers, at least one of them is positive, 
and the p; are distinct prime numbers. This factorization is frequently called 
the canonical factorization of n. In order to have a standard notation, we will 
also write a prime number in the above form, with the understanding that 
a, may be | and the other exponents may be zero. 
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PROBLEM 


1. Let E be the set of all even positive integers. An element of E is called 
“ E-prime”’ if it is not a product of two smaller positive elements of E; for 
example, the first five E-prime elements are 2, 6, 10, 14, and 18. 

(a) Prove that an element of EF is either E-prime or is a product of E-prime 
elements. 

(b) By an example, show that the factorization into a product of E-prime 
elements is not unique. 


1.9 / THE GREATEST INTEGER FUNCTION 


In number theory we are frequently interested in the greatest integer that 
is not greater than a given real number «. This number is usually denoted by 
the symbol [a]. For example, [5144] = 5, [—7.3] = —8, [x] = 3. The integer 
[a] is called the integral part of « and the difference « — [a] is usually called 
the fractional part of « (even though it may be irrational). 

The quantity [«] arises naturally in our study for it enables us to explicitly 
express the quotient and remainder after division in terms of the original 
integers. For example, if b is a positive integer and if 


a=bq+r O<r<b 


then q = [5] a & 7 Lé]) 


A similar expression holds if 5 is negative (see Exercise 1). 
Some properties of the greatest integer function are established in the 
next theorem. Other properties are listed in Exercises 4 and 5. 


a THEOREM 8/ 


(a) If mis an integer, then [m + a] =m + [a]. 


(6) [w+ BJ -1<[o¢] + [6] <[o + f]. 
(c) If mis a positive integer, then [a/m] = [[o]/m]. 


PROOF OF (6): Write « = [a] + 0, where 0 < @ < 1 (@ is the fractional part 
of «), and B =[6]+ 9, 0< @ <1. Substituting and using the result of (a), 
we obtain 


[« + 6] =[Loe] + [6] + 6+ o] =[«] + [6] +[6+ 9] 
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Now 0 < 6 + @ < 2, so that 
0<[0+o]<1 
Thus, [ao + Pp] =>[o] + [Pl] =[o+ Bp] -1 


The proofs of (a) and (c) are left for the reader. 


EXERCISES 


1. Let b be a negative integer, and let 


a=bq+t+r O0<r< bd 


Express g and r in terms of a and b. 7 


2. Prove part (a) of Theorem 8. 
3. Prove part (c) of Theorem 8. 
4. Prove that the integer closest to the real number «@ is [x + 44]. 


5. Let m be a positive integer and let « be a real number. Prove: 


fo} + [a+] + [ars] e + [e+ 2] =n] 
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ORIGIN OF THE HINDU-ARABIC NUMBER SYSTEM 


A number of studies have been made of the number systems and the 
methods of counting used in various parts of the world. Basing our remarks 
on these studies and on the known history of the system, we can trace, with 
reasonable accuracy, the development of the Hindu-Arabic notation for 
assigning symbols to stand for the natural numbers. 

Our primitive ancestor probably counted on his fingers. We conjecture 
that when he counted to five fingers he said “‘one hand.’’ When he reached 
ten fingers (or possibly ten fingers and ten toes) he said “‘one man.” (Thus 
the use of the word digit for both a number symbol and a finger.) 

When society developed to the point where more elaborate calculations 
were needed, one of our ancestors invented the abacus—a computing device 
that effectively extended the number of fingers at his disposal for counting 
purposes. 
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A possible prototype of the primitive abacus is illustrated in Figure 1.1. 
Recall that our primitive man did not actually need ten fingers in counting 
if he replaced the number “ten fingers’’ with its equivalent “‘one man.” In 
our prototype of the abacus we have a frame of parallel wires, each strung 
with nine beads. To count the number “‘one”’ a bead is pushed to the top 
of the wire at the extreme right. Beads are pushed to the top of that wire in 
sequential order until all nine are up, representing the number “nine.” To 
count the number “ten”’ the nine beads are pushed down and one bead is 
pushed up on the wire next to the one on the extreme right. To represent 
“one hundred”’ a bead is pushed up on the third wire from the right. Thus, 
the number ‘“‘three hundred twenty-seven”’ is represented on the abacus in 
Figure 1.1. 

There are many variations in the form of the abacus. Our prototype in 
Figure 1.1 is comparatively sophisticated because it has only nine beads on 


Figure 1.1 Prototype of a primitive abacus. 


each wire. A more primitive model would probably have ten beads on each 
wire although only nine are actually needed. When the abacus was used in a 
society where the counting was based on “hands” rather than “men,” it 
probably had four (or five) beads on each wire.’ 

The last stage of development is the invention of symbols to represent the 
numbers. It is natural to base the notation on the abacus. There are two 
standard ways to do this. 

The first method is to assign a symbol to stand for a bead on one of the 
wires of the abacus, a different symbol for a bead on a different wire, and so 
on. To represent a number, we repeat the necessary symbols as many times 
as there are beads pushed up on the corresponding wires. For example, in 
the Roman number system the symbol XXVIII stands for 2 beads on the 


1The names of the numerals in the French language seem to indicate a system 
based on the number twenty. 


22 m BASIC CONCEPTS 


“tens” wire, 1 bead on the ‘“‘fives’’ wire, and 3 on the “‘units’’ wire; that 
is, the number 28.7 

The second method is to assign symbols to represent the number of beads 
pushed up on a wire and let the position of the symbols indicate which wires 
are being described. If 9 beads are on each wire, 9 symbols are needed. Using 
the symbols currently popular, we write the number “‘ three hundred twenty- 
seven”’ as “‘327”’ (3 beads on the “‘hundreds”’ wire, 2 on the “‘tens”’ wire 
and 7 on the “units”’ wire). 

The Babylonians and the Hindus had systems of the second type—the 
Babylonians’ system based on the number sixty and the Hindus’ system 
based on ten. 

The second method has one defect. How does one represent a number if 
one of the wires has no beads pushed up? The great invention of the Hindus 
was a symbol, now written “0’’, to indicate this fact. In all probability the 
symbol was at first only a place holder, but it soon became apparent that by 
using this symbol calculations could be carried out as efficiently without the 
abacus as with it.° 

We do not know exactly when the Hindus invented the system (probably 
after A.D. 600), but about A.D. 800 it was adopted in the Arab world and 
was gradually introduced into Europe. The major force behind its adoption 
was Leonardo Pisano (also called Fibonacci) of Pisa who urged its adoption 
in the early thirteenth century. The monks and scholars continued to use 
Roman numerials, but the new system was quickly adopted by the Italian 
merchants. Despite minor setbacks,* the usage spread, and by the sixteenth 
century the Hindu-Arabic numbers were used throughout Europe. 
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We have observed that the number ten plays a special role in our number 
system. The obvious question follows: Does the number ten have any special 
properties which make it more suitable for the special role than the other 
positive integers? The answer, of course, is no. If our ancestors had been 
born with twelve fingers we would probably have a system based on the 
number twelve or on the number twenty-four (assuming they had twelve 
toes as well). 


?The Roman number system is more complicated than as indicated above 
because the principle of subtraction is also used. 

3We need the ten symbols ‘‘0, 1, 2, 3, 4, 5,.6, 7, 8, 9,”’ the multiplication and 
addition tables (each with 100 entries), the associative, commutative, and 
distributive laws. 

“For example, the use of Hindu-Arabic numbers was forbidden to the 
Florentine merchants in 1299. 
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We will now prove that any integer g greater than 1 can be used as a base 
for a system of enumeration similar to that of the Hindus. Obviously, the 
only requirements such a number must fulfill are: 

(a) There must exist g symbols to represent the numbers, 0, 1, 2, ---,g — 1, 
and 

(b) if m is a natural number, we must be able to write 


m=ag't+a,-1g '+::+ayg+a, (0<a,;<g) 
For example, if g is 7 and m is 68, we can write 
68 =1-774+2-745 


Thus, in the system based on seven (assuming the symbols 0, 1, 2, 3, 4, 5, 6 
represent the integers “zero” through “six’’) we would write sixty-eight 
as 125. 
In order not to confuse this number with the number one hundred twenty- 
five it is customary to use subscripts> to indicate the base; for example, 
68,5 = 125, 


Since the assignment of g symbols causes no theoretical difficulty, we must 
only prove that condition (b) always holds. 


=» THEOREM 9/ Letg be an integer greater than 1. Ifmis anatural number, 
there exist integers dp, a,, -*:, a,, O< a; <g, such that 


m=ag't+a,—-19' +: ayg + ao 


PROOF: We divide m by g, obtaining 
M =og + Ao O0O<a)<g 
Obviously, qo = 0. If go is positive, we divide g, by g obtaining 
Jo=Hng+a O<a,<g 
We now continue in this fashion until one of the quotients is equal to zero, 
obtaining the following chain of equations: 
M = og + ao O<a,<g qo > 9 
do=Mi9 + a O<a,<g qi, >0 


*The subscripts are written in base 10. 
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di: = 429 + a2 O<a,<g qz>0 


Gr-2= 4-19 + -1 O<a,-1,<g G:-1>9 
Qa-1=49+ O<a,<g q, =0 
Since q, is the first zero quotient, then a,, which is equal to q, — ,, is positive. 
We now substitute successively: 
M = dog + ao 
= (419 + 41)9 + 49 = Gig" + ayg + a 
= (42g + a2)9* + a1g + Ay = Gng* + ang’ + 44g + Ao 


= (4-19 +4,- sg ++ +449 + a 
=G:-19° + 4,- 19° ++ +. 44g + ao 
=ag't+a,-i1g'~ 1 ++ +ayg + a 
The proof of Theorem 9 gives us a practical method to change from base 
10 to base g. Note that the “‘units’’ digit, ao, is the remainder when m is 


divided by g. The next digit is the remainder when qo, the quotient just 
obtained, is divided by g, and so on. 


Example. 
Write 2314, in the base 7. 


SOLUTION : 
2314=330°7+4 aud 
330= 47-7+1 a,=1 
47= 6:°7+4+5 a,=5 
6= 0:7+6 az, =6 
Thus, 231410 = 6514. 


It is customary to use the symbols 0, 1, 2, ---, 9 to represent the same numbers 
as in base 10 whenever this is feasible. Thus, in the above example, the seven digits 
are ‘‘0, 1, 2, 3, 4, 5, 6’’. If the base g is larger than 10, new symbols must be invented. 
For example, in the base 12, we need a symbol for ten and a symbol for eleven. 


EXERCISES 


1. Make addition and multiplication tables for the base 2. Convert 237,5 
and 194,, to base 2 and multiply them together. Check your answers by 
converting back to base 10. 
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2. Follow the instructions in Exercise 1 for the base 20. 

[The reader will observe that in Exercise 1, the multiplication and addition 
tables are simple (and easily learned) although the calculations are laborious. 
In Exercise 2, the tables are large (and difficult to memorize), but the calcu- 
lations are easy. In general, the larger the base the simpler the calculations, 
with the drawback that the tables are more difficult to prepare and memorize. 
For practical purposes, base 10 seems to be as convenient as any other. ] 


PROBLEMS 


THE GAME OF NIM 


In the ancient Chinese game of Nim, two players alternate in withdrawing 
stones from 3 piles that have been formed by a third person. At his turn, a 
player must remove at least 1 stone from one of the piles and he may remove 
as many as he wishes from that one pile. The person who removes the last 
stone from the last pile wins the game. 

Analysis of the Game | After each turn write the number of stones in each 
pile in base 2, arranging the three numbers so that corresponding digits lie 
in a column. For example, if the numbers are 17, 21, and 35, we write 


17,9 =010001, 
21,.=010101, 
35,,.=100011, 


We say that a player is in a position of Type A if, after his turn, each column 
contains an even number of one’s. If at least one column contains an odd 
number of one’s, we say the player is in a Type B position. Thus, the player 
who leaves 17, 21, 35, as above, is in a Type B position, while a player who 
leaves 17, 21, 41s in a Type A position. [If we write 17, 21, and 4 in base 2, 
we obtain 

17,5 =10001, 


21,,=10101, 
4.0=00100, 


which has an even number of one’s in each column. | 


1. Show that if a player is in a Type A position the other player must go 
to a Type B position at his next turn. If a player is in a Type B position it is 
possible for the other player to go to a Type A position. 
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Thus, if a careful player is once able to get into a Type A position he can 
remain in such a position throughout the rest of the game and the other 
player will be forced to remain in a Type B position. Since the winning 
position (0, 0, 0) is of Type A, the player who stays in a Type A position is 
assured of winning the game. 


RUSSIAN FARMER’S MULTIPLICATION 


The following multiplication technique was used by the peasants during 
the Middle Ages in Russia and some parts of Europe. 

To multiply two positive integers, successively divide one of them by 2, 
neglecting the remainders, and then write the quotients in a column. We 
successively double the other integer as many times as we halved the first, 
arranging the results in a similar column. Next we remove from the second 
column all entries for which the corresponding numbers in the first column 
are even. Finally, we add together the remaining numbers in the second 
column. The sum is seen to be the product of the two original numbers. As 
an example, we multiply 25 by 17: 


25 17 
12 BA 
6 63 
3 136 
1 272 
425 = 25-17 


2. Prove that the method described above always works. [Hint: Show that 
the final addition is equivalent to multiplying the second number by the first 
when the first number is written in base 2.] 


LINEAR 
DIOPHANTINE 
EQUATIONS 


2.4 / INTRODUCTION 


Let «, B, y be real numbers, not both « and B equal to zero. The equation 
ax + By =y 1s called a linear equation because the corresponding graph is a 
straight line in the X Y-plane. 

The problem of solving this equation becomes complicated if we wish to 
restrict ourselves to integral solutions. For example, the equation x + y = 1 
obviously has an infinity of integral solutions; however, the equation 
6x + 8y =7 has none. (If we substitute integers for x and y, the left-hand 
side is even and is thus not equal to 7.) 


= DEFINITION / A problem to be solved in integers is called a diophantine 
problem, after the Greek mathematician Diophantus (about a.p. 300). An 
equation of form ax + by =c (a, b, c integers, not both a and b equal to 
zero) is called a linear diophantine equation (in two variables) if it is to be 
solved in integers. 


The study of diophantine problems is one of the oldest branches of mathe- 
matics. Undoubtedly, one reason for this is the fact that man has used the 
positive integers much longer than the other number systems. Diophantine 
equations also furnish a natural vehicle for puzzles and problems of a mathe- 
matical nature. 

Although Diophantus receives credit for the first systematic investigation 
of diophantine equations, partial results had been obtained by the Greeks 
of earlier times. Actually, considered by modern standards, Diophantus was 
not very systematic in his study. He was usually content to find only one 
solution of an equation, and he did not allow negative solutions.’ He used 


1Diophantus apparently had no conception of negative numbers apart from 
their role in subtraction. 
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SO many special techniques that a person who had studied his solution of one 
problem might have no idea of how to attack a similar problem. 

~ The Hindus, on the other hand, did make a systematic study of linear 
and certain nonlinear diophantine equations. Aryabhatta (born A.D. 476) 
completely solved the linear diophantine equation in two variables by a 
method called the pulverizier, a variation of the method we shall use to 
prove Theorem 1. 

During the middle ages, the Arabs served as custodians of the Greek and 
Hindu knowledge, eventually introducing much of it into Europe. Although 
several Arab mathematicians considered diophantine equations during this 
period, they made no original contribution to the subject. 

Diophantine problems, formulated as puzzles, were popular in medieval 
Europe. They were usually solved, of course, by trial-and-error methods. In 
the seventeenth century, the French mathematician Bachet (1581-1638), un- 
aware of the work of the Hindus, also solved the linear diophantine equation 
in its most general form. Although this ended the mathematicians’ search for 
solutions to particular problems of this type, the laymen continued to enjoy 
the linear diophantine puzzle. 

Some interesting puzzles come from the medieval period. One of the oldest 
in our possession is the following from the tenth century. It is believed to be 
from a copy of a collection of puzzles prepared for Charlemagne by the 
monk Alcuin. 


One hundred measures of grain are distributed among 100 persons so 
that each man receives three measures, each woman two measures and 
each child half a measure. How many men, women, and children are 
there? 


A number of problems similar to this one have been transmitted through 
the ages virtually intact as part of the folklore. They can frequently be recog- 
nized by one of the following two characteristics: (1) the special role of the 
number 100, and (2) the use of prices not compatible with those of the present 
time. For example, the following problem was popular with the author’s high 
school classmates: 


Pigs cost 3 pennies each, sheep 2 pennies each, and rabbits are 5 for a 
penny. A man is given 100 pennies and told to buy 100 animals. How 
many does he buy of each kind? 


In this chapter we shall prove necessary and sufficient conditions which 
insure that the linear diophantine equation has a solution, derive the general 
(parametric) form of the solution, and study the technique of solution used 
by Euler. 
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2.2 / THE DIOPHANTINE EQUATION ax + by=c 


= THEOREM 1 / The linear diophantine equation ax + by =c has a solu- 
tion if, and only if, the greatest common divisor of a and b divides c. 


PROOF: (Part I) Let d = (a, b) and suppose that d|c. There exists an integer 
k such that c = kd. Since d is a sum of multiples of a and b, we may write 


am + bn=d 
Multiplying this equation by k, we obtain 
a(mk) + b(nk) = dk =c 


so that x = mk, y = nk is a solution. 
(Part II) Suppose that xo, yo is a solution of the equation. Then 


aXyo + byy =. 


Since d divides a and b, then d divides c. 


Observe that the proof of Theorem 1, together with the Euclidean algorithm, 
provides us with a practical method to obtain one solution of the equation. 
We write (a, b) as a sum of multiples of a and b and then multiply by an 
appropriate constant. 

The problem of solving a linear diophantine equation can be considered 
from a geometrical standpoint. A point in the X Y-plane is called a Jattice 
point if its coordinates are integers. Thus, solving the diophantine equation 
ax + by =c is equivalent to finding all lattice points that lie on the straight- 
line graph of the equation ax + by =c in the XY-plane. The slope of this 
line is equal to the rational number 


a_—ald 


b b/d 


where d = (a, b). Therefore, if xo, Yo is a solution, the pair 
Xo + kb/d, Vo — ka/d 


is also a solution for any integer k. Figure 2.1 pictures the situation for the 
diophantine equation 2x + 3y =2. 
We will now establish that these are the only solutions. 
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(X9+L/0, Yo- G/0) 


Figure 2.1. Graph of the linear diophantine equation 2x + 3y = 2. 


Ja THEOREM 2 / Let (a, b) =d. If Xo, yo is a solution of the diophantine 
equation ax + by =c, the general solution of the equation is 


xX =X + kb/d, y = yo — kald 


where the parameter k is an integer. 


(Because d is a common divisor of a and b, the rational numbers b/d and 
a/d are obviously integers.) 


PROOF: We established above that if k is an integer, then x, + kb/d, 
Yo — ka/d is also a solution (see also Exercise 5). We must now show that all 
solutions are of this form. 

Suppose that x,, y, 1s a solution of the diophantine equation. Then 


ax, + by, =cCt=> AX» + byo 
so that ax, — Xo) = —b0); — Yo) (2.1) 


Since (a, b) =d, there exist relatively prime integers a’ and b’ such that 
a =a'd, b = b'd (Corollary 4.3 of Chapter 1). If we substitute in (2.1) and 
cancel d, we obtain 

a(x, — Xo) = —5'; — Yo) (2.2) 


Thus, b’la’(x, — Xo) and (a’, b‘) =1. From Corollary 4.4 of Chapter 1, it 
follows that 
b’|(x; — Xo) 


that 1s, X,—- Xo = kd’ 


for some integer k. If we substitute this value into (2.2) and cancel b’, we 
obtain 


/ 


V1 —Vo = —ka 
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Thus, 
Xy =X +t kb’ =XxX_Q + kb/d, y, =Vo — ka’ = Vo — kald 


Since x,, y,; was an arbitrary solution, this implies that all solutions are of 
the desired form. 


Example |. 


Solve the diophantine equation 


738x + 621y = 45 


SOLUTION: We use the Euclidean algorithm to calculate (738, 621): 
738 = 1-621+4 117 
621 =5-117+ 36 
117=3- 36+ 9 
36=4: 9 
Since (738, 621) =9, which divides 45, the equation has a solution. Working 


backwards through the above chain of equations, we write 9 as a sum of multiples 
of 738 and 621: 


9 = 117 —3- 36= 16: 117 —3 - 621 = 16: 738 — 19: 621 
Multiplying by 5, we obtain 
45 = 80 - 738 — 95 - 621 


so that x» = 80, yo = —95 is a solution. The general solution is 


621 
x = 80+ > k = 80+ 69k 


738 
y= POS ge ie —95 — 82k 


A few of the solutions are listed in Table 2.1. 


Table 2.1 
k x y 
—2 —58 69 
=e —] 11 —13 
0 80 —95 
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Example 2. 


Obtain all positive solutions of 


7x + Sy = 100 


SOLUTION: The greatest common divisor of 7 and 5 is 1. By inspection we write 
3:7—4-5=1 
Thus, one solution of the original equation is 


Xo = 300, yYo= —400 
The general solution is 
x = 300+ 5k 


y =400 — 7k 
Because we wish to find the positive solutions, we solve the inequalities 
300 + 5k >0 
—400 —7k >0 


and obtain —60 <k < —57'4. Therefore, the diophantine equation has exactly 
two positive solutions corresponding to the values k = —59, k = —58. See Table 2.2. 


Table 2.2 


EXERCISES 


1. Solve the two diophantine problems stated in the text of Section 2.1. 


2. Solve the diophantine equations 
(a) 57x — 87y = 342; 
(b) 1411x + 1547y = 224. 


3. Obtain all positive solutions of the diophantine equations in Exercise 2. 
4. Find a solution of 105x — 273y — 195z = 1365. 


5. Show by direct substitution that if x9, yo is a solution of ax + by=c 
and d = (a, b), then x, + kb/d, yo — ka/d is also a solution for any integer k. 


2.3 | THE METHOD OF EULER «a 33 


PROBLEMS 


1. Prove that if a and b are relatively prime positive integers, the dio- 
phantine equation 


ax — by =c 
has an infinity of positive solutions. 


2. Establish necessary and sufficient conditions which insure that the linear 
diophantine equation 


AX, + AgX_ +++ +4,x, =5 


has a solution. 


2.3* / THE METHOD OF EULER 


From the theoretical standpoint, we have completely solved the problem 
of obtaining solutions to linear diophantine equations. We have a simple 
test that can be used to determine whether solutions exist. If so, we have 
a method to obtain one solution that can then be used to obtain all solutions. 

From the practical standpoint, however, much is still to be desired. The 
use of the Euclidean algorithm is cumbersome, especially if the equation is in 
more than two variables. Furthermore, the solutions obtained are apt to be 
very large in comparison with some of the other solutions. An alternate 
method of solution that does not have these defects would be convenient. 

Such a method is ascribed to the great Swiss mathematician L. Euler 
(1707-1783). It involves a simple process, repeated several times. The process 
is easy to understand, requiring little more than the division algorithm and 
the closure properties of the integers under addition and subtraction.” 

Although lengthy, it is no longer than the process involving the Euclidean 
algorithm and is frequently much shorter (especially if the equation is in 
more than two variables). 

Euler’s method is best introduced by an example. 


Example. 
Solve the diophantine equation 


738x + 621y = 45 


2The fact that the sum and difference of two integers are also integers. 
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SOLUTION: Assume that x, y is a solution of the equation. The first step is to solve 
for the unknown that has the smallest coefficient in absolute value; in this case, y. 
Using the fact that 738 = 621 + 117 and 45 = 0- 621 + 45, we obtain 


—738x + 45 —117x+ 45 


621 621 


If x and y are integers, then (—117x + 45)/621 is also an integer, which we will 
call t. We now rewrite the expression for ¢ as the dipohantine equation 


621¢ = —117x + 45 


which has smaller coefficients than the original equation. We now repeat the 
process, solving for x, which has the smallest coefficient in absolute value: 


—621t + 45 —36t +45 
= = —5t4+ ——-_— 


oe 117 
=—St+u where je 
117 
This gives us the equation 
36¢ + 117u = 45 
Solving for t we obtain 
oe —sut14+—a*? = —se+ i+ 

where 36v = —9u-+ 9 and v is an integer. 


Solving for u we now obtain 
u= —4v+ 1 
To obtain one solution, we pick a convenient value of v, say v =0, and 
work back through the chain of equations: 
v =0 
u=—4v+1=1 
t=—3u+1l+v0=—-2 
x=-3t+u=11 
y=-x+t=-13 


Thus, x = 11, y = —13 1s a solution. 
To obtain the general solution, we let v be the parameter and substitute 
into the chain of equations: 


u= —4v+ 1 
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t=—3u+1+v0=13v-2 
x=—-St+u= —69v+ 11 
y= —xX4+t = 820 — 13 


Thus, the general solution is x = —13 + 82v, y = 11 — 69v. 
This solution should be compared with the solution obtained in Example 1 
of Section 2.2. 


EXERCISES 
1. Use Euler’s method to solve 
Slw + 102x + 34y + 170z = 1377 
2. Where does Euler’s method break down if we attempt to use it to solve 
738x + 621ly = 51? 


3. An apartment building contains apartments at two rental rates: Those 
of Type A rent for $87 per month and those of Type B for $123 per month. 
When all apartments are rented the gross income is $8733 per month. How 
many apartments are there of each type? 


4. Into the bright and refreshing outskirts of a forest, full of numerous 
trees with their branches bent down with the weight of flowers and fruits, 
trees such as jambu trees, lime trees, plantains, areca palms, jack trees, date- 
palms, hintala trees, palyras, punnaga trees and mango trees—the various 
quarters of which were filled with the many sounds of crowds of parrots and 
cuckoos near found springs containing lotuses with bees roaming about 
them—a number of weary travelers entered with joy. 

There were 63 equal heaps of plantain fruits put together with 7 more of 
the same fruits. These were distributed equally among the 23 travelers so as 
to leave no remainder. Tell me now the number of fruits in each heap. 
(Mahaviracarya, A.D. 850.) 


TO THE THEORY 


INTRODUCTION / 3 
OF CONGRUENCES 


3.1 / INTRODUCTION 


Many problems in number theory lead to the consideration of a set of 
integers in which any two elements differ by a multiple of a fixed integer. 
For example, the general solution of the diophantine equation 14x + 10y = 12 
is xX = —2+ 5k, y =4-— 7k. Thus, the value of x in a solution must come 
from the set of all integers of form —2 + 5k; that is, the set 


{-.-, —12, —7, —2, 3, 8, 13, ---} 
and the value of y must come from the set 
{---, —10, —3, 4, 11, 18, 25, ---} 


In the first set any two elements differ by a multiple of 5, in the second set, 
any two elements differ by a multiple of 7. 

The study of sets of this type is facilitated by the concept of congruence, 
due to C. F. Gauss (1777-1855). We will develop parallels between the 
concepts of congruence and equality and then return to our study of these sets. 


m DEFINITION / Let a, b, and m be integers, m>0O. We say that a is 
congruent to b, modulo m (a = b (mod m)) if m divides a — b. 


For example, 6 = 13 (mod 7) and —23 =7 (mod 15). In the example 
preceding the definition the first set of integers contains all numbers con- 
gruent to —2 (mod 5) and the second set all numbers congruent to 4 
(mod 7). 

The similarity between the symbols for equality and congruence is not 
accidental. We will prove that congruence (mod m) has many properties 
similar to those of equality. The notation is important for two basic reasons: 
(1) It enables us to solve certain problems involving divisibility by the use 
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of algebraic methods; and (2) the notation suggests certain problems that 
probably would not have been formulated otherwise. 

The first theorem states that congruence (mod m) is an equivalence 
relation.’ 


= THEOREM 1 / Let m be a positive integer. 
(a) If a is an integer, then a = a (mod m). 
(b) If a= b (mod m), then b = a (mod m). 
(c) If a=b and b=c (mod m), then a= c (mod m). 


PROOF OF THE TRANSITIVE PROPERTY: Since a = b and b = c (mod m) 
then a — b and b—c are multiples of m. It follows that the sum of these 
numbers, a — c, is a multiple of m. Thus, a = c (mod m). 

The proofs of the other two properties are left for the reader (Exercise 2). 


EXERCISES 


1. Which moduli are used to measure the following times? 
(a) The days of the week. 
(b) The hours of the day. 
(c) The minutes after the hour. 


2. Prove the reflexive and symmetric properties of congruence (mod m). 


3. Which of the following are equivalence relations? If a relation is not 
an equivalence relation, which properties does it lack? 


(a) The relation “‘less-than-or-equals”’ for integers. 
(6) The relation “is a descendant of ” for persons. 
(c) The relation “is congruent to”’ for triangles. 
(d) The relation “‘is complementary to”’ for angles. 
(e) The relation “‘a|b”’ for integers. 


1A relation on a set S is a collection R of ordered pairs of elements of S. If 
the ordered pair (a, 5) is in the relation R, it is customary to write aRb rather 
than (a, b) Ee R. 

An equivalence relation on S is a relation R, having the properties: 

Reflexive Property: aRa for each a in S; 

Symmetric Property: If afb, then bRa; 

Transitive Property: If aRb and bRc, then aRc. 

Examples of equivalence relations are: (1) equality of numbers; (2) similarity 
of triangles; (3) concentricity of circles. 
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PROBLEMS 


1. Discuss the concept of congruence, modulo zero. 


2. Let m be a positive integer. Suppose that 
a,=mq,t+r, O<r,<m 
and ay =mq, +1 O<r,<m 


Prove that a, = a, (mod m) if, and only if, r; = r.. 


3.2 / ARITHMETIC PROPERTIES OF CONGRUENCE 


We have shown that congruence (mod m) has properties similar to equality. 
We will now prove that we can add and multiply congruences in a manner 
similar to that in which we operate using equality. 


m THEOREM 2/ If a=a’ (mod m) and b=D’ (mod m), then a+b= 
a’ + b’ (mod m) and ab = a’b’ (mod m). 


PROOF OF THE MULTIPLICATION PROPERTY: Because a = a’ and b= b’ 
(mod m), there exist integers k and n such that 


a—a =km b—b' =nm 
Multiplying, we obtain 
ab =(a’' + km)(b’ + nm) = a'b! + m(a’n + b'k + knm) 
Thus, m|(ab — a’b’) and so 
ab = a'b’ (mod m) 


The proof of the addition property is left for the reader. 

One of the important properties of the integers is the cancellation law for 
multiplication: If ab = ac and a #0, then b = c. A natural conjection would 
be that a corresponding cancellation law holds for the integers under con- 
gruence; that is, /f ab = ac (mod m) and a #0 (mod m), then b = c (mod m). 
It is easy, however, to construct an example to show that this cannot be 
correct. For example, 8-3 = 8:5 (mod 16) but 3 45 (mod 16). We see then 
that a cancellation law for multiplication must be more complicated when 
we are concerned with congruence rather than with equality. The correct 
form of the cancellation law is given in the following theorem. 
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= THEOREM 3/ (The Cancellation Law) If ab = ac (mod m), then b=c 
(mod m/d), where d = (a, m). 


PROOF: We divide a and m by d writing 
a=a'd,m=mi'd where (a’, m’) = 1 (see Corollary 4.3 of Chapter 1) 
Because ab = ac (mod m), we can write 
ab —ac=km 
Substituting from above and cancelling d, we obtain 
a’'d(b — c) = km'd 
a’(b — c) =km' 


Thus, m'|a'(b — c) and (m’, a’) =1. From Corollary 4.4 of Chapter 1, it 
follows that m’, equal to m/d, divides b — c. Thus, 


b = c (mod m/d) 


In one important special case we obtain a cancellation law similar to the 
one under equality. 


m= COROLLARY 3.1 /If (a4,m)=1 and ab=ac (mod m), then b=c 
(mod m). 


EXERCISES 
1. Prove: If a= b (mod m) and n is a positive integer, then a” = 5” (mod m). 
Show by an example that the converse is not necessarily true. 


2. Prove: If f(x) is a polynomial with integral coefficients, and if a=b 
(mod m), then f(a) = f(b) (mod m). 


3. Let p be a prime number. Prove: If ab = ac (mod p) and a # 0 (mod p), 
then b = c (mod p). 


4, Let p be a prime number. Prove: If a? = 1 (mod p) then a= +1 (mod p). 


PROBLEMS 


‘* CASTING OUT NINES ”’ 


1. If n is a positive integer, prove that n is congruent to the sum of its digits 
(written in base 10), modulo 9; that ts, if 


n = 4,10" + a, _ 10*-} + oie +a, 10 + a 
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then N=Q,+4~1 +++ +a, + dap (mod 9) 


[This fact was used extensively for checking numerical calculations before the 
invention of the desk calculator. For example, in order to check the compu- 
tation when multiplying 8207 by 1530, we proceed as follows: 


8207 = 8+ 2+7= 8 (mod 9) 
1530 = 1+5+3=0 (mod 9) 
12,556,710 =14+2+5+5+6+7+1=0(mod 9). 


Since 8:0 =0 (mod 9) the answer checks. Note that this does not tell us 
that the answer is correct. It only tells us that if we have a mistake the error 
is a multiple of 9. In other words, we can expect the method to detect errors 
86 of the time.] 


2. Obtain a rule similar to “‘casting out nines” for the modulus 11. 


3. Generalize the results of Problems 1 and 2 for an arbitrary base g. 


3.3 / THE LINEAR CONGRUENCE ax = b (mod m) 


Consider the linear congruence ax = b (mod m). By a solution of this 
congruence we mean an integer x, such that ax9 = b (mod m). The central 
problems of this section will be to decide when the above congruence has a 
solution, and to determine all solutions. 

Suppose Xp is a solution of the above congruence. Then there exists an 
integer yo such that 


aX = b — Yom 


Thus, adX9 — Myo = b, so that Xo, Yo iS a solution of the linear diophantine 
equation 


ax — my =b 


On the other hand, if the linear diophantine equation ax — my = b has a 
solution X9, Yo, then 


aX) — MYo = b 
and so aX, = b (mod m) 


Thus, we have proved the following important fact: The linear congruence 
ax = b (mod m) has Xo as a solution if, and only if, there is a integer yo such 
that Xo, Vo is a solution of the linear diophantine equation ax — my = b. 
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We are now in a position to state the basic theorem on linear congruences 
by paraphrasing Theorems | and 2 of Chapter 2. 


= THEOREM 4/ The linear congruence ax = b (mod m) has a solution if, 
and only if, the greatest common divisor of a and m divides b. If this is the 
case, and if x, is a solution of the congruence, then the general solution is 


m 
= d — 
x Xo( mo “} 
where d = (a, m) 


The details of the proof are left for the reader. 
Example I. 


Because (42, 91) = 7, the linear congruence 
42x = 50 (mod 91) 


does not have a solution. 
Example 2. 
Solve 42x = 50 (mod 76). 


SOLUTION: Because (42, 76) divides 50, the congruence has a solution. From the 
cancellation law we obtain 


21x = 25 (mod 38) 
Thus, 
21x = 25 = 25 + 38 = 63 (mod 38) 
Dividing by 21 (the g.c. d. of 21 and 38 is 1) we obtain 
x = 3 (mod 38) 


which is the general solution. If we wish to write the solution in terms of the original 
modulus, we have 


x=3 or x = 41 (mod 76) 
In some cases we wish to write the general solution of the congruence 


ax = b (mod m) in terms of the modulus m. From Theorem 4, we see that 
the general solution is x = Xp (mod m/d); thus, x is a solution if and only if 
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x 1s in the set 


m 
>Xg9 ——,X9,Xo + 


m -~mM 
d 7 X42, -| 


Pumped t aiak 


d 


It is easy to see that every number in this set is congruent to one of the 
numbers 


m m 
Xo. Xo ay ***, Xo +(d— Ia 


modulo m, and that these numbers are incongruent to each other. Thus, we 
obtain the following corollary. 


wm COROLLARY 4.1 / If x9 is a solution of ax = b (mod m) and if d = (a, m), 
the numbers 


m m 
Xo, Xo pare ve" Xo +(d—- 1) 


form a complete set of incongruent solutions, modulo m; that is, the numbers 
are incongruent (mod m), they are all solutions, and any solution is congruent 
to one of the numbers in the set (mod m). 


m COROLLARY 4.2 / Let d= (a, m). If d|b, the linear congruence ax = b 
(mod m) has exactly d incongruent solutions, modulo m. 


EXERCISES 


1. Solve each of the following linear congruences. If a solution exists write 
the general solution in terms of the original modulus. 


(a) 23x = 41 (mod 52) 
(b) 68x = 100 (mod 120) 
(c) 42x = 37 (mod 63) 
(d) 70x = 50 (mod 90) 
2. Construct a formal proof of Theorem 4. 


3. Let x9 be a solution of ax = b (mod m) and let d =(a, m). Prove that 
the numbers 


m m 
Xo, Xo er “t's XO + (d— a 


t 
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are solutions of the congruence, that they are incongruent, modulo m, and 
that any solution of the congruence is congruent to one of these numbers, 
modulo m. 


PROBLEM 


1. Discuss the linear congruence ax + by = c (mod m). Generalize the result 
to the congruence a, xX; + a, X,+-::+4a,x,=c (mod m). (For a related 
problem, see Problem 2 of Section 2.2.) 


3.4 / RESIDUE CLASSES 


= DEFINITION / A set of integers containing exactly those integers which 
are congruent, modulo m, to a fixed integer is called a residue class, modulo m. 


Example. 


There are seven residue classes, modulo 7, namely the sets: 


{---, —14, —7,0, 7, 14, 21, -- 
{---, —13, —6,1, 8, 15, 22, -- 
tee05- = 12, -=9, 2; 9,16, 23,=: 
{---, —11, —4, 3, 10, 17, 24, -- 
{---, —10, —3, 4, 11, 18, 25, -- 
ft <9, = 2,9; 12, 19,26, 
{---, —8, —1, 6, 13, 20, 27, --- 


e e e eo ° 


The first set contains all those integers congruent to 0 (mod 7), the second set, 
those congruent to 1, ---, the seventh set those congruent to 6 (mod 7). 


We will use the symbol #, (mod m) to represent the residue class, modulo m, 
that contains the integers congruent to a, modulo m. Thus, each residue 
class can be denoted by an infinity of symbols. For example, the first residue 
class above can be represented by any of the following symbols: ---, B_,4, 
R41, Ro, By, Bray °° 

Several of the elementary properties of residue classes follow from 
Theorem 1. They are given in the following theorem. 


= THEOREM 5/ Let.m be a positive integer. 

(a) 2, = &,, if and only if a = b (mod m). 

(b) Two residue classes (mod m) are either disjoint or equal. 

(c) There are exactly m distinct residue classes (mod m) and they contain 
all of the integers. 
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PROOF: (a) If a= b (mod m), it follows from the transitive property of 
congruence that an integer is congruent to a (mod m) if, and only if, it is 
congruent to b (mod m). Thus, Z, = &,,. 

Suppose #, = &,. Because a is in Z,, ais in @,. Thus, a = b (mod m). 

(b) Suppose &, and &, have an element c in common. Then c = a (mod m) 
and c = b (mod m). From the symmetric and transitive properties of con- 
gruence, it follows that a = b (mod m). From (a) it now follows that 2, = &,. 
Thus, either #, and &, are disjoint or they are equal. 

(c) If a is an integer, we can divide a by m writing 


a=mqt+r O<r<m 


Thus, a=r (mod m) and so &, = &,. This implies that a is in one of the 
residue classes Bo, Z;, ---, Am — 1. Because the integers 0, 1, 2, ---, m— 1 are 
incongruent (mod m), it follows that there are exactly m residue classes 
(mod m). 


Using the residue class concept we can reformulate our results on the 
linear congruence ax = b (mod m). Because the general solution of the 


congruence is 
m 


we see that x is a solution if, and only if, x is in Z,,, (mod m/d). Using Corol- 
lary 4.1, we find that x is a solution if, and only if, x is in one of the residue 
classes 


By Rx. + m/d> °°" > Ry. + (d — 1)m/d (mod m) 
Example. 


In Example 2 of Section 3.3 we found the general solution of 42x = 50 (mod 76) 
to be 


x = 3 (mod 38) 


Thus, x is a solution of the congruence if and only if x is in 23 (mod 38). 


EXERCISES 


1. (a) List six elements from each of the following residue classes modulo 17: 
R, Ryo, Rr, Rs, R_475 R _ 80. 


(b) Which of the above residue classes are equal? 
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2. Write the solutions of the linear congruences in Exercise 1 of Section 3.3 
using residue classes. 


PROBLEMS 


(Knowledge of elementary modern algebra is required.) 
THE RING OF RESIDUE CLASSES (mod m) 
We define the operations “ @ ”’ and “‘.’’ on the residue classes, modulo m, 
as follows: 
ZR, ® Ry a R, +b 
Rye ky =Bay 
For example, modulo 7, 
R,BB7 =f, —6, 1, 8, 15, ---} @{---, —4, 3, 10, 17, ---} 
a Rig = ents —3, 4, 11, 18, vee} 


1. Show that the two operations are “well defined,’’ that is, the operations 
depend only on the residue classes involved and not on the particular sub- 
scripts used to represent the residue classes. 


2. Show that the residue classes (mod m) form a commutative ring with 
identity under the operations “ @” and “.”’. 


3. Show that the ring of residue classes (mod m) is a field if, and only if, 
m is a prime number. 


3.5 / SYSTEMS OF LINEAR CONGRUENCES 


Consider the system of linear congruences 
a,x = b, (mod m,) 


a,x = b, (mod m,) 


a,x = b, (mod m,) 


By a solution of this system we mean an integer x, which is a solution of each 
congruence in the system. 
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Because each congruence in the above system determines a residue class, 
the solutions of the system are the numbers in the intersection of these 
classes. Thus, we may replace any one of the above congruences by a different 
congruence that defines the same residue class. 

In Part II of the solution of the following example, we demonstrate an 
efficient method for solving a system of linear congruences. 


Example I. 


Solve the system ew aod te , 


4x = 12 (mod 14) 


PART I: By the methods of Section 3.3, we see that xo is a Solution of 5x =7 
(mod 12) if and only if xo is in 4, (mod 12), and xo is a solution of 4x = 12 
(mod 14) if and only if xo is in #3 (mod 7). Thus, xo is a solution of the system of 
congruences if, and only if, xo is in the intersection of the two residue classes 
x = 11 (mod 12) 


2, (mod 12) and &3 (mod 7). Therefore, the system ‘ =3 (mod 7) 


has the 
same solutions as the original system. 


PART Il: Suppose Xo iS a Solution of the system. Then xo = 11 (mod 12), and so 
we can write 


Xo =11+ 12s 
Substituting this into the second congruence, we obtain 
Xo = 11+ 12s = 3 (mod 7) 
Thus, 
12s = —8 (mod 7) 
—2s = —8 (mod 7) 
s= 4(mod 7) 
If we now write s = 4+ 7t and substitute into the expression for xo, we obtain 
Xo = 11+ 12s =59-+ 841 
Thus, Xo = 59 (mod 84) 


We have shown that if a solution xo exists, it must be congruent to 59, modulo 84. 
If we now substitute back in the original system, we see that any number congruent 
to 59, modulo 84, is indeed a solution. Thus, the general solution is 


x = 59 (mod 84) 


We now prove the fundamental theorem for systems of linear congruences. 
This theorem and its corollary give necessary and sufficient conditions which 
insure that a system has a solution and determine the form of the general 
solution. 
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x = a(mod m) 


=» THEOREM 6/ The system of congruences x = b (mod n) 


has a solu- 
tion if, and only if, (m, 2)|(a — b). If this is the case and if x, is a solution, 
the general solution is 
X = X_ (mod [m, n]) 
PROOF: (Part I) The integer x is a solution of the system of congruences 

if, and only if, there exists an integer k such that 

Xp =at+km 
and a+km= b (mod n) 
By Theorem 4 such a number &k will exist if, and only if, 

(m, n)|(a — b) 


(Part II) Suppose that (m, n)|(a — b) and that x, is a solution of the system 
of congruences. If x, is also a solution, then 


Xx; = a= Xp (mod m) 
and x, = b= Xo (mod n) 


Thus, x; — Xo is a common multiple of m and n and, by Theorem 5 of 
Chapter 1, [m, n]|(x, — x9). In terms of congruences, 


Xx, =X, (mod [m, n)) 
Conversely, if x, = Xo (mod [m, n]), then obviously 
Xx, = Xo =a (mod m) 
and Xx; = Xo = b (mod n) 
and so x, is also a solution of the system. Thus, the general solution is 
xX = X_ (mod [m, n]) 
Several results can be proved for systems of more than two congruences. 
Two corollaries will be stated, the proofs will be left for the reader. 
=» COROLLARY 6.1 / The system of linear congruences 
x =a, (mod m,) 


x =a, (mod m,) 


x =a, (mod m,) 
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has a solution if, and only if, (m,, m,)|(a; — a,), for each pair of subscripts /, k. 
If this is the case and if x9 is a solution, the general solution is 


X = Xo (mod [m,, My, °°", m,)) 


For the special case in which the moduli m,, m,, ---, m, are relatively prime 
in pairs, it is obvious that the system in Corollary 6.1 will have a solution. 
In this case we can write the solution of the system in closed form. This fact 
was known to the Chinese mathematician Sun-tse (first century A.D.) and 
the corresponding theorem is known as the Chinese Remainder Theorem. 


=m COROLLARY 6.2 /(The Chinese Remainder Theorem.) Suppose 
M,, M2, ***, M, are pairwise relatively prime, that is, (m;, m,) = 1 if i #7. Let 
M =m, mz, ---m,. We define numbers 5,, b2, ---, b, by choosing y = b; as a 
solution of 


M 
7 = 1 (mod m)) (j= 1, 2, ++, n) 


J 
The general solution of the system 
x =a, (mod m,) 


x = a, (mod m,) 


x =a, (mod m,) 


M M M 
x= a,b, — + a,b, = + ao + a,b, a (mod M) 
Mm, My M,, 


Corollary 6.2 furnishes the most efficient method for solving several 
systems of congruences when the same sets of moduli appear in each system. 


Example 2. 


Solve the systems of congruences 


x = 5 (mod 7) x= 4 (mod 7) x= 1 (mod 7) 
x = 8 (mod 19)’ |x = 16 (mod 19)}’ |x = —8 (mod 19) 
x = 2 (mod 7) 
and 
x = 5 (mod 19) 
Because y = b, is a solution of 19y = 1 (mod 7), we may choose 5; = 3. Similarly, 

we choose b2 = —8. Then 

M 

b, — =3-19=57 
m 


1 
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M 
and b, — = —8:7= —56 
m 


2 


We now find the solutions of the systems by substituting into the congruence 
in Corollary 6.2: 


System I: x =5-57+ 8(—56) = 103 (mod 133). 
System II: x = 4-57+ 16(—56) = 130 (mod 133). 
System IIIT: x = 1-57 — 8(—S5S6) = 106 (mod 133). 
System IV: x =2°57-+ 5(—56) = 100 (mod 133). 


EXERCISES 


1. Calculate the intersection of the residue classes: 
(a) BZ, (mod 15) qn &, (mod 21) nN &,; (mod 37), 
(b) 2,2 (mod 26) nN &,g (mod 21) nN &, (mod 36). 


2. Solve: (a) (x = 2 (mod 5)) (6) {x = 1 (mod 5)) (c) {x = 4 (mod 5) 

f = 3 (mod 7 (* = 4 (mod 7 @ = 1 (mod 4 
3. (Ancient Hindu Problem) If eggs are taken from a basket two, three, four, 
five, and six at a time there are left over, respectively, one, two, three, four, 
and five eggs. If they are taken out seven at a time, there are no eggs left over. 
How many eggs are in the basket? 


4. (Ancient Chinese Problem) A number of units of work are to be per- 
formed by workmen. If sets of 2, 3, 6, and 12 men work for certain numbers 
of whole days there remain, respectively, 1, 2, 5, and 5 units of work not 
completed. How many units are there? 


PROBLEMS 


1. Prove Corollary 6.1. 
2. Prove Corollary 6.2. 


3.6 / INTRODUCTION TO HIGHER ORDER CONGRUENCES 


The fundamental problem in the theory of equations is that of finding all 
solutions of f(x) = 0, where f(x) is a polynomial. A related number theory 
problem is that of finding all solutions of 


JS (x) = 0 (mod m) 
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where f(x) is an integral polynomial, that is, a polynomial with integral 
coefficients. 

If a is a solution of f(x) =0 (mod m), then any integer congruent to a, 
modulo m, is also a solution. Therefore, our problem is to find all incongruent 
solutions of f(x) = 0 (mod m). In general, this problem is very difficult and 
many of the techniques of solution depend partially on trial-and-error 
methods. In this section we will prove some general theorems concerning 
these solutions. 

Our first result is a theorem which shows us that if m is not a power of a 
prime, the problem of solving f(x) = 0 (mod m) can be reduced to that of 
solving a system of congruences in which the moduli are smaller than m. 


= THEOREM 7/ Let M=m,:m, --- m,, where the integers m,, mo, ---, m, 
are relatively prime in pairs. The integer a is a solution of 


f(x) = 0 (mod M) 
if, and only if, a is a solution of the system 
f(x) = 0 (mod m,) 
I(x) = 0 (mod m;,) 


f(x) = 0 (mod m,) 


PROOF: If f(a) =0 (mod M), then obviously f(a) =0 (mod m,) G = 1, 
2, ---, n). Conversely, suppose a_is a solution of the system 


(x) = 0 (mod m;) (i = 1, 2, ---, n) 


Then f(a) is a solution of the system 


y = 0 (mod m,) 
y = 0 (mod m,) 
y = 0 (mod m,) 


and it follows from Corollary 6.2 that f(a) =0 (mod m, -: mz, --- m,). Thus, 
ais a solution of f(x) =0 (mod M). 


Example I. 
Let f(x) = x5 — 3x* + 7x3 — 2x? — 9x + 6. Solve f(x) = 0 (mod 165). 


SOLUTION: Since 165 = 11-5 - 3, we first find all solutions of f(x) = 0 (mod 11), 
f(x) = 0 (mod 5), and f(x) = 0 (mod 3). 
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By substituting the values x = 0, 1, 2, ---, 10, we find the solutions of f(x) = 
(mod 11) to be 


x=1 x=6 and x = 8 (mod 11) 
Similarly, the solutions of f(x) = 0 (mod 5) are 
x=1 x=2 and x = 3 (mod 5) 
and the solutions of f(x) = 0 (mod 3) are 
x=0 and x = 1 (mod 3) 


Consequently, there are 18 solutions, corresponding to the 18 systems of 
linear congruence that we can write with one member from each of the above 
three sets of “‘ partial solutions.”’ 

For example, from the systems 


x = 1 (mod 11) x = 6 (mod 11) 
x =1 (mod 5) and x =3(mod 5) 
x =0(mod 3) x=1(mod 3) 


we obtain, respectively, the solutions x = 111 (mod 165) and x = 28 (mod 
165). 

One of the basic theorems in the theory of equations is the factor theorem. 
We will prove a corresponding theorem for congruences and use it to deter- 
mine the maximum number of incongruent solutions of a polynomial con- 
gruence with a prime modulus. 


= THEOREM 8/ (The Factor Theorem.) Let f(x) be an a integral poly- 
nomial of degree n > 0. The integer a is a solution of 
f(x) = 0 (mod m) 


if, and only if, there exists an integral polynomial g(x) of degree n — 1 such 
that 
F(x) = (x — a) + g(x) (mod m) 
PROOF: If f(x) = (x — a) + g(x) (mod m) 
then f(a) = (a — a) - g(a) = 0 (mod m) 


To prove the converse, we divide f(x) by x — a, obtaining a quotient poly- 
nomial g(x) and a constant remainder r. Obviously, g(x) has integral co- 
efficients and is of degree m — 1, while r is an integer. Thus, 


f(x) = (% — a) + g(x) +r 


52 m INTRODUCTION TO THE THEORY OF CONGRUENCES 


If we substitute x = a, we obtain 
f(a) =(a— a) g(a) +r=r(mod m) 


Because a is a solution of f(x) =0 (mod m), we also know that f(a) =0 
(mod m). Thus, r = 0 (mod m), implying that 


I(x) = (x — a) g(x) (mod m) 


The next two theorems are only valid if the modulus is a prime. (Example 1 
furnishes a counterexample for the two propositions if we allow the modulus 
to be composite.) 


m THEOREM 9 / Let p be a prime number and f(x) an integral polynomial 
of degree n. If a,, az, ---, a, are incongruent solutions of f(x) = 0 (mod p), 
there exists an integral polynomial g,(x) of degree n — t such that 


F(X) = (% — ay)(% — a2) +++ (% — a,)g,(x) (mod p) 


PROOF: By Theorem 8 there is an integral polynomial g,(x) of degree 
n — 1 such that 


F(x) = (* — ay) + 9i(x) (mod p) 

Since a, 1s a solution of f(x) = 0 (mod p), then 
F (42) = (a2 — 4) + 9;(a2) = 0 (mod p) 
Since a, — a; #0 (mod p), then 
g,(a2) = 0 (mod p) 

We now apply Theorem 8 again and write 

gi(x) = (X% — az) + g2(x) (mod p) 
and substitute, obtaining 

f(x) = (& — a1) + (% — a2) + g2(x) (mod p) 

where the degree of g,(x) is n — 2. 


If we repeat this argument using successively a3, a4, ---, a, we eventually 
obtain | 


F(x) = (% — a1)» & — a2) +» (& — 4,) + g(x) (mod p) 


where g,(x) is an integral polynomial of degree n — ¢. 


= THEOREM 10/ (The Theorem of Lagrange.) Let p be a prime number, 
and let f(x) be an integral polynomial of degree n. The congruence f(x) = 0 
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(mod p) has at most n incongruent solutions, modulo p, unless each co- 
efficient of f(x) is congruent to zero, modulo p. 


The proof, which follows immediately from Theorem 9, is left for the 
reader. 


EXERCISES 


1. Find all 18 solutions of the congruence in Example 1. (As mentioned 
earlier, the most efficient way to work problems of this type is by using 
Corollary 6.2.) 


2. Solve the following congruences: 
(a) x° — 2x* — 5x + 10 = 0 (mod 17) 
(b) x? — 11 =0 (mod 23) 
(c) x? + 11 = 0 (mod 23) 


3. Solve: 2x? — 25x? + 9x + 1 = 0 (mod 715). 


PROBLEMS 


1. Where does the proof break down if we try to prove Theorem 9 for a 
composite modulus, using the same argument as for a prime modulus? 


2. Prove Theorem 10. 


3. Let f(x) be an integral polynomial. For each positive integer k, let a(k) 
denote the number of incongruent solutions of 


J (x) = 0 Gmod k) 
Prove: If m,, m,, ---, m, are pairwise relatively prime integers, then 
a(m,m, --- m,) = a(m,) am) --- a(m,) 
4. Let f(x) be an integral polynomial of positive degree. Prove that 
FQ), £2), + £m), + 


are not all prime numbers. 


APPENDIX / A CALENDAR PROBLEM 


In this section we will derive a formula in which the day of the week is 
expressed as a function of the calendar date. 
In the Gregorian calendar a normal year consists of 52 weeks and one 
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day, while in a leap year an extra day is added at the end of February. 
Recall that a noncentury year is a leap year if it has a number divisible by 4, 
but a century year is a leap year only when it has a number divisible by 400. 
Thus, 1600 was a leap year, but 1700, 1800, and 1900 were not. 

We will make two restrictions in order to simplify our formula. (1) Because 
a century leap year occurred 18 years after the calendar reform in 1582 we 
will find it convenient to develop a formula only for the years subsequent 
to 1600. (2) In order not to add the extra day during the course of the leap 
year, we will start the new year on March 1. Thus, for purposes of our 
computation, February 29, 1872 is considered to be the last day of the 
twelfth month of 1871. 

To reduce the problem to one involving numbers, we assign the numbers 
1, 2, ---, 12 to the months March, April ---, February, and the numbers 
0, 1, ---, 6 to the days Sunday, Monday, ---, Saturday, respectively. 

Let us suppose that March 1, 1600 fell on day of the week number a. Then 


March 1, 1601 fell on day a + 1 (mod 7) 
March 1, 1602 fell on day a + 2 (mod 7) 
March 1, 1603 fell on day a + 3 (mod 7) 
March 1, 1604 fell on day a + 4 + 1 (mod 7) 


In general, March 1 of the year 1600 + ¢ falls on day number d, where 
d=a+t+(the number of elapsed leap years) (mod 7) 


It is not difficult to compute the number of elapsed leap years. If t < 100, 
this number is simply [7/4]. If t = 100, it is the number [¢/4], minus the 
number of elapsed century years, [t/100], p/us the number of elapsed century 
leap years, [t/400]. (These numbers would be much more difficult to calculate 
if we did not start with a century leap year.) 

Therefore, March 1 of the year 1600 + ¢ will occur on the day of the 
week that is congruent, modulo 7, to 


“++[a]~[iee] + [a0] 


If the reader substitutes the value of ¢ obtained from the current year and 
sets this congruent to the day on which March | falls (found by checking a 
current calendar) he will find that a = 3 (mod 7). Therefore, March 1, 1600 
fell on a Wednesday. 

Our formula to determine d, the day on which March | falls in the year 
1600 + ¢ is thus 


an3+r+[{]-[4]+[g] moan 


We will now rewrite this formula so that we can substitute the year number 
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directly. We write the year number JN in the form 
N=100C+D 0< D< 100 
For the year N we obviously have 
t = N — 1600 = 100 C + D — 1600 
If we now substitute this value of ¢ into the above formula we obtain 


100 C+ D— a] 


d= 3+ (100 C + D— 1600) + | 4 


pees D-— _ r be D— 1007 a 
100 400 ? 


Using Theorem 8(a) of Chapter 1, we can rewrite this as 


D 
d=3+100C + D~ 1600 +25c+[ =] - 400 


D 100C + D 
-c-[ a] +16 +[ =] - 4 (mod 7 


Because 0 < D < 100, then [ D/100] = 0. It is easy to show that 


ec) -[9] 


Substituting these values into the above expression and simplifying, we 


obtain 
D 
d=3-2c+D+[5]+[ 2] (mod?) 


We now have a formula to compute the day of the week on which March 1 
falls in a given year. In order to extend this formula to cover other dates, we 
consider the following information: 


The number of April 1 is 3 units greater than that of March 1. 

The number of May 1 is 2 units greater than that of April 1. 

The number of June 1 is 3 units greater than that of May 1. 

The number of July 1 is 2 units greater than that of June 1. 

The number of August | is 3 units greater than that of July 1. 

The number of September | is 3 units greater than that of August 1. 
The number of October 1 is 2 units greater than that of September 1. 
The number of November | is 3 units greater than that of October 1. 
The number of December | is 2 units greater than that of November 1. 
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The number of January | is 3 units greater than that of December 1. 
The number of February 1 is 3 units greater than that of January 1. 


It was observed by Rev. Zeller that the function f(m) = [2.6m — 0.2] takes 
on the same increments as the above table when m varies from 1 to 12. 


When m=1, we obtain f(m) =[2.6 —0.2] =2. Thus, the formula for 
March | becomes 


C D 
d=1+ (2.6 -0.2]-2c+ D+[F] + [7] (moa 7 
and the first day of the mth month falls on day 
C D 
d’=1+ [2.6m —0.2] —-2C+D+ [=] + [=] (mod 7) 


From this it is obvious that the rth day of the mth month of year 100C + D 
falls on day number 


d=r+ [2.6m —0.2] —2C+ D+ [S| + [z] (mod 7) 


Example. 
Determine the day of the week on which January 17, 1932 fell. 


SOLUTION: For the purposes of our computation, January 1932 is considered to 
be the eleventh month of 1931. Thus, 


r=17 m= 11 C= 19 D=31 


From the formula we obtain 


d= 174+ [(2.6)(11) —0.2] —2-19+31+ [=] + [=] 


= 17+ 28 — 38+ 31+4+7 =0 (mod 7) 
Thus, January 17, 1932 was a Sunday. 


EXERCISES 


1. Determine the day of the week on which you were born. 


2. Show that 
100C + D 
[a0 100 —-|- E ; | ito <0 < 100 


THE EULER-FERMAT 
THEOREM 


41/ COMPLETE SYSTEMS OF RESIDUES 


The most important function considered in elementary number theory is 
Euler’s totient function g.’ In this chapter we will develop certain properties 
of this function, using the concepts of complete and reduced systems of 
residues. 


a DEFINITION / Let m be a positive integer. A set of integers is called a 
complete system of residues, modulo m, if the set contains exactly one element 
from each residue class, modulo m. 


Example. 


Let m = 7. Because 1 is in %,, 16is in #22, —11 isin 43, 53 is in 24, —44 is 
in 2s, 69 isin Z, and —77 is in &, then 


{1, 16, —11, 53, —44, 69, —77} 


is a complete system of residues (mod 7). 


Because there are exactly m distinct residue classes (mod m), namely 
Ro, Bi, BR, +--+, Am —1, then a complete system of residues must contain 
exactly m elements. The simplest complete system of residues is, obviously, 
0,1,---,;m—1. 

The first theorem states a different set of conditions under which a set is 
known to be a complete system of residues. This is important, because the 
conditions in the theorem are usually easier to check than are the conditions 
in the definition. 


1Buler’s function will be defined in Section 4.2. 
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m= THEOREM 1 / Let m be a positive integer and let S be a nonvoid set of 
integers. S is a complete system of residues (mod m) if, and only if, 

(1) S contains m elements, and 

(2) no two elements of S are congruent, modulo m. 


PROOF: Obviously, if S is a complete system of residues, the two conditions 
are satisfied. 

To prove the converse, we note that if no two elements of S are congruent, 
the elements of S are in different residue classes (mod m). Since S has m 
elements, all the residue classes must be represented among the elements 
of S. Thus, S is a complete system of residues (mod m). 


PROBLEMS 
1. Let m be a positive integer and let r be relatively prime to m. Show that 
the finite arithmetic progression 
{a,a+r,a+t+2r,--,a+(m—1)-r} 
is a complete system of residues (mod m). 


2. Suppose (m, n) = 1. Let {a,, a5, ---, a,,} be a complete system of residues 
(mod m), and let {b,, b5, ---, b,} be a complete system of residues (mod n). 
Let S be the set of all integers of form 

an-+b,jm @=1, 2 yan, J = 1,208) 


Show that S is a complete system of residues (mod mn). 


4.22 / REDUCED SYSTEMS OF RESIDUES 


Corollary 4.6 of Chapter 1 can be restated using the congruence notation 


as follows: If 
a = b (mod m) 


then (a, m) = (b, m) 


In other words, if a and b are in the same residue class (mod m) then 
(a, m) = (b, m). This leads us naturally to the following definition: 


ms DEFINITION / Let &, be a residue class (mod m). We say that &, is 
relatively prime to m provided each element of &, is relatively prime to m. 

This is equivalent to saying that @, is relatively prime to m if, and only 
if, (a, m) = 1. 
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Example |. 


R,, Rs, R1, and Z (mod 10) are the residue classes that are relatively prime 
to 10. 


Euler’s totient function @ is defined as follows: 


m DEFINITION / Let m be a positive integer. p(m) is the number of residue 
classes (mod m) that are relatively prime to m. 


From the above example, we see that (10) = 4. Similarly, since 2,, #3, 
Rio, R_5, #,,, and #,; (mod 14) are the residue classes relatively prime 
to 14, we see that g(14) = 6. Obviously, g(1) = 1 since | is relatively prime 
to the single residue class (mod 1). 


m DEFINITION / A nonvoid set of integers is called a reduced system of 
residues, modulo m, if the set contains exactly one element from each residue 
class (mod m) that is relatively prime to m. 


Example 2. 


Since 1 is in 2:, —17 isin &s3, 87 is in &;, and 19 is in Z> (mod 10), then 
{1, —17, 87, 19} 
is a reduced system of residues (mod 10). Similarly, 
{1, 3, 19, —5, 11, 23} 


is a reduced system of residues (mod 14). 


One method to obtain a reduced system of residues is to start with a 
complete system of residues and delete those elements that are not relatively 
prime to the modulus. Thus, the simplest reduced system of residues, 
modulo m (m > 1), is just the collection of all integers in the set {1, ---, m — 1} 
that are relatively prime to m. 

We will now prove a theorem similar to Theorem | that gives us an alternate 
set of conditions under which a set of integers is known to be a reduced 
system of residues (mod m). In actual practice the conditions in the theorem 
are usually easier to check than are the conditions in the definition. 


= THEOREM 2/ Let m be a positive integer. A set of integers S is a 
reduced system of residues (mod m) if, and only if, 

(1) S contains exactly g(m) elements, 

(2) no two elements of S are congruent (mod m), 
and (3) each element of S is relatively prime to m. 
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PROOF: It is obvious from the definition that a reduced system of residues 
must satisfy all three conditions. 

To prove the converse we suppose that S is a set of integers having the 
three properties. Because no two elements of S are congruent, the elements 
are in different residue classes (mod m). Since the elements of S are relatively 
prime to m, they are in residue classes that are relatively prime to m. Thus, 
the g(m) elements of S are distributed among the g(m) residue classes that 
are relatively prime to m, one in each residue class. Therefore, S is a reduced 
system of residues, modulo m. 


The conditions established in Theorem 2 are frequently easy to check 
when considering particular examples. As a simple application we state the 
following corollary (the proof is left for the reader): 


m COROLLARY 2.1 / If {r;, 72, +++, rocmy} is a reduced system of residues 
(mod m), and if (a, m) = 1, then {ar,, ar, «++, Ar gcmy} is also a reduced system 
of residues (mod m). 


EXERCISE 


1. Prove Corollary 2.1. 


PROBLEM 


1. Let mand n be relatively prime. Let {@,, a2, «++, Agmy} and {b,, bo, +++, Docny} 
be reduced systems of residues, modulo m and n, respectively. Let S be the 
set of all integers of form 


ain + bjm (i = 1, 2, «+, p(m); j = 1, 2, «+, P()) 


Prove that S is a reduced system of residues (mod mn). (This result is 
needed for Problem 1 of Section 4.3.) 


4.3 /EVALUATION OF EULER’S TOTIENT FUNCTION 


= DEFINITION / A function « defined on the positive integers is said to be 
multiplicative 1f «(mn) = a(m) + a(n) whenever m and n are relatively prime. 


For example, the function a, defined in Problem 3 of Section 3.6 is a 
multiplicative function. The function that is identically zero 1s trivially a 
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multiplicative function. The function f, defined by B(n) = n, is a multipli- 
cative function. 

As a first step in obtaining a formula for y(n), we shall prove that 9 is a 
multiplicative function. Because it is easy to establish that o(4) =2 and 
(5) = 4, we can then calculate that @(20) = ~(4) - v(5) = 8, so that a reduced 
system of residues (mod 20) contains eight elements. Note that the fact that 
gy is multiplicative does not imply that (20) = g(2) -: g(10). (Why not?) 


= THEOREM 3/ The function 9 is multiplicative. 


PROOF: Let m and n be relatively prime positive integers and let 
{1, Az, ***, Ag¢my} and {b,, bo, +++, Dyyy} be reduced systems of residues, modulo 
m and n, respectively. We will calculate a set of numbers {c;,} as follows: For 
each pair i and j where 1 <i < g(m), 1 <j< p(n), V we let c,;; be the unique 
solution of the system of congruences 


x =a; (mod m) 
x = 5b, (mod n) 


which is in the range 1 < x < mn (see Theorem 6 of Chapter 3). Thus, we 
obtain the set of g(m) - y(n) numbers listed in Table 4.1. 


Table 4.1 
b, b, _ b, = bm 
ay C11 C12 A C1j ee C1 gn) 
a> C21 C22 ais C2 j — C2 ,e(n) 
a; Cit Ci2 Cij oe Ci p(n) 
29m) Co(m),1 Co(m),2 me Co(m),j — Co(m),(n) 


If we can show that this set is a reduced system of residues, modulo mn, we 
will know that g(mn) = o(m): g(n). The proof of this fact will be in three 
parts. 

I. We will show that the c’s are incongruent (mod mn). Suppose c;; =C., 
(mod mn). Then 


A; = Cy; = Cy, = a, (mod m) 


which, because the a’s came from a reduced system of residues (mod m), 
implies that 7 = s. In a similar way we see that b; = b, (mod n) and, therefore, 
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j=t. Thus, ifi #¢ s or j # ¢, then c;; # c,, (mod mn). In other words, we have 
~(m) - p(n) incongruent numbers in the set {c;,;}. 
II. Next we will show that each c;, is relatively prime to mn. Since a; = ¢;; 
(mod m), then 
(c;;,m) =(a;,m) = 1 


and, similarly (c,;,n) = (b;, n) = 1 


Since c;,; is relatively prime to both m and n, then c,, is relatively prime to m. 

III. Let #, be a residue class (mod mn), that is relatively prime to mn. 
We will show that one of the c’s is in &,. Since &, is relatively prime to mn, 
the integer k is relatively prime to mn, and thus also to m and n. Since 
{1, ***, Ag¢my} 1S a reduced system of residues, modulo m, there is an element 
in this set, say a,;, that is congruent to k (mod m). Similarly, there is an 
element b, that is congruent to k (mod n). Thus, k is a solution of 


x =a, (mod m) 
x = b; (mod m) 


and so, by Theorem 6 of Chapter 3, k =c,; (mod mn). Thus, c;; is in the 
residue class Z, (mod mn). 

Hence, the numbers in the constructed set are all in different residue 
classes, and a residue class is represented in the set if, and only if, it is rela- 
tively prime to mn. Therefore, the set is a reduced system of residues, modulo 
mn; and @g(m)-: y(n), the number of elements in the constructed set, is equal 
to g(mn). 


Because ¢ is multiplicative, we will be able to evaluate the function if we 
can evaluate it for powers of each prime p. 


=» THEOREM 4/lIf p is a prime number and « is a positive integer, 
o(p*) = p* (1 — I/p) = p** -(p— 1). 


PROOF: Because the set of integers 


‘p} 


is a complete system of residues, modulo p*, we can obtain a reduced system 
by deleting from this set all elements that are not relatively prime to p*. 
Since p is a prime, the only numbers not relatively prime to p* are the mul- 
tiples of p. Thus, we must delete the p*~ ' numbers 


{l, 2. "Dy D + 1, ree, 2D, vee, 3p, vee, Pp 


ee: 


{D, 2p, 3p, ar P} 


Therefore, 


—_ tos a I 
p(p*) = p*— p* ' =p*" '(p-—l=p (1 --| 
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m COROLLARY 4.1 / If n =p," p,” --- p,’*, where the p’s are distinct 
prime numbers, then 


wera 2)(i-B) (tf 


The proof, which follows directly from Theorem 4 and the multiplicative 
property of @, is left for the reader. 


Example. 
Calculate g(7? - 13% - 19). 
SOLUTION: 9(7? - 13% - 19) = ¢(7?) - p(13*) - p19) (by Theorem 3) 
=7(7— 1): 133 13 —1)-d19 — 1) 
= 1,680,932 


EXERCISES 


1. Calculate @(9702) and (873). 

. Prove that o(n) is even if n > 2. 
. Prove Corollary 4.1. 

. Find all solutions of o(n) = 14. 


On -& WH WN 


. If w is a multiplicative function, not identically zero, prove that a(1) = 1. 


PROBLEM 


1. Use Problem 1 of Section 4.2 to devise a new proof that @ is multipli- 
cative. 


4.4/THE EULER-FERMAT THEOREM 


The significance of Euler’s totient function in elementary number theory 
rests primarily on certain of its properties that are superficially unrelated to 
the definition of the function. Two of these properties will be established in 
Theorem 5 and Theorem 7. 

A special case of Theorem 5 was proved by Fermat and the general theorem 
was later proved by Euler. We will reverse the historical order and prove 
the generalization first. (Also see Problem 3.) 
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m THEOREM 5 / (Euler—Fermat) If m is a positive integer and (a, m) = 1, 
then 
a?) = 1 (mod m) 


PROOF: Let {r,, 72, °**s gcmy} be a reduced system of residues (mod m). 
It follows from Corollary 2.1 that {ar,, arz, +++, Argcmy} is also a reduced 
system of residues (mod m). Thus, each ar; is congruent to one, and only one, 
of the elements in the original set. Therefore, the products of the elements 
in the sets are congruent; that is, 


(ar,)* (ar) °° (Argcmy) =11l2°** lon) (mod m) 


Because each r; is relatively prime to m, we apply the cancellation law for 
congruences to obtain a”) = 1 (mod m). 


=m COROLLARY 5.1 / (Fermat’s Theorem) If p is a prime number and a is 
an integer, then 
a? = a (mod p) 


PROOF: If (a, p) = 1, then o(p) = p — 1. By Theorem 5, 
1 (mod p) 


p-il 


a 
and so a? = a (mod p) 
If (a, p) #1, then a=0 (mod p). Thus, 
a? = 0 =a (mod p) 
Example. 
Calculate 7'°!° (mod 31). 
Because (31) = 30 and (7, 31) = 1, 
7°° = 1 (mod 31) 
Since 1015 = 30: 33 + 25, 
7iois = 730°334+25 = 133 3 725 = 725 (mod 31) 


To calculate 775 (mod 31) we first calculate 7" (mod 31) where n = 2, 4, 8, 16, 
obtaining 
7? = 49 = — 13 (mod 31) 


7*=169= = 14 (mod 31) 
7%?=196= 10 (mod 31) 
7'*° = 100 = 7 (mod 31) 
Thus, 775 = 7-78 -7!©=7-10:7 = 490 = 25 = —6 (mod 31). 
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EXERCISES 


1. Calculate 3'°°° (mod 7). 
2. Calculate 7/°°° (mod 54). 
3. Use Fermat’s theorem to prove: If p is a prime, then (a + b)? =a’ + b? 
(mod p). 
PROBLEMS 


1. If p is a prime and | < k < p — 1, show that the binomial coefficient (7) 
is divisible by p. 


2. Prove Fermat’s theorem by induction on a. [Hint: Expand @’= 
[((a — 1) + 1]? (mod p) by the binomial theorem (Problem 3 of Section 1.2) 
and use Problem 1.] 


3. (Knowledge of elementary modern algebra is required.) Prove that the 
Euler-Fermat theorem follows from the fact that the order of an element of 
a finite group divides the order of the group. 


4.5* / APPLICATION TO HIGHER ORDER CONGRUENCES 


Fermat’s theorem can frequently be used to obtain information about the 
solutions of the congruence 


f(x) = 0 (mod p) 


where f(x) is an integral polynomial. 
The most obvious use of the theorem is in reducing the congruence to one 
of lower degree, if f(x) has degree greater than or equal to p. 


Example |. 


Let f(x) = 3x'° — 7x? + 4x® — 3x7 + 2x® — x95 + x3 — 4x? — x43. 
Show that the congruence f(x) = 0 (mod 5) has the same solutions as 
3x? + x?+x+2=0 (mod 5) 


SOLUTION: If x is any integer, from Fermat’s theorem we obtain 


xw=x, x8 Hx*,x7 = x9, x8 = xt, x? =x, and x'° = x? (mod 5) | 
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Substituting these values into the congruence f(x) = 0 (mod 5) and reducing the 
coefficients (mod 5), we see that x is a solution of the above congruence if and only 
if x is a solution of 


(1) 4x* + 3x3+x?+x+3=0 (mod 5) 


In this particular congruence, x = 0 (mod 5) is obviously not a solution, so that 
any solution must be relatively prime to 5. Thus, we can apply the Euler-Fermat 
theorem, obtaining x* = 1 (mod 5) for each solution x. If we substitute this into 
the congruence (1), we see that any solution of f(x) = 0 (mod 5) must also be a 
solution of 


(2) 3x3 + x?+x+2=0 (mod 5) 


Because x = 0 (mod 5) is not a solution of this last congruence, any solution of (2) 
must also be a solution of (1). Thus, f(x) = 0 (mod 5) and x3+x?7+x+22=0 
(mod 5) have the same solutions. 


For the rest of this section we will assume that the degree of the poly- 
nomial congruence f(x) = 0 (mod p) is n, where 0 <n <p. 

A problem of interest is that of determining when the integral polynomial 
f(x) is congruent (mod p) to a product of distinct linear terms. This is equiva- 
lent (see Theorem 9 of Chapter 3) to determining when f(x) = 0 (mod p) 
has n incongruent solutions (mod p). Before devising a test for this property, 
we make one further observation: If 


Ax) = a,x" + ay ax" bt +ayx +o, a, £0 (mod p) 
there is a polynomial 
g(x) = x" + b,x" * + + byx + do 

such that f(x) = 0 (mod p) and g(x) = 0 (mod p) have the same solutions. We 
obtain g(x) by multiplying f(x) by a solution of 

a,y = | (mod p) 
and reducing the coefficients (mod p). 
=» THEOREM 6/ Let p be a prime number and let f(x) be an integral 
polynomial of degree n, 0<n<p, which has a leading coefficient of 1. 
Divide x? — x by f(x) obtaining a quotient g(x) and a remainder r(x), where 
the degree of r(x) is less than n. The congruence f(x) = 0 (mod p) has n 


incongruent solutions (mod p) if, and only if, each coefficient of r(x) 1s 
divisible by p. 


PROOF: We write x? — x = f(x): q(x) + r(x). Since the leading coefficient 
of f(x) is 1, then q(x) and r(x) have integral coefficients. 


4.5 | APPLICATION TO HIGHER ORDER CONGRUENCES a 67 


Part I. Suppose f(x) = 0 (mod p) has nv incongruent solutions. Because 
r(x) is congruent (mod p) to 


x? — x — f(x) q(x) 


and (by Fermat’s theorem) any solution of f(x) = 0 (mod p) is also a solution 
of x? — x =0 (mod p), then r(x) = 0 (mod p) has at least n incongruent 
solutions. From the Theorem of Lagrange (Theorem 10 of Chapter 3), it 
follows that each coefficient of r(x) is divisible by p. 

Part IT. Suppose each coefficient of r(x) is divisible by p. Then r(x) = 0 
(mod p) and x? — x = 0 (mod p) have p incongruent solutions. Thus, 


F(x) + q(x) = x? — x — r(x) = 0 (mod p) 


also has p incongruent solutions. Since f(x) - g(x) has degree p and since any 
solution of f(x)g(x) = 0 (mod p) must be a solution of f(x) = 0 or g(x) =0 
(mod p), it follows that f(x) = 0 (mod p) has 7 incongruent solutions and 
q(x) = 0 (mod p) has p — n incongruent solutions (mod p). 


Example 2. 


Determine whether the congruences 
(a) x*+ 2x3 + 2x?+%x+2 = 0 (mod 5) 
and (b) 2x* + 3x3 — 3x? + 8x = 0 (mod 5) 


have four incongruent solutions. 


SOLUTION: (a) Dividing x*° — x by x* + 2x? + 2x? + x +2 we obtain 
x5 — x = (x4 4+ 2x2 4+ 2x? +44 2)°- (x —2)4 2x3 4+ 3x? —x4+ 4) 


Since the coefficients of the remainder are not divisible by 5, the congruence does 
not have four incongruent solutions. 
(b) Multiplying 2x* + 3x* — 3x* + 8x by 3, we see that 


x*-+ 4x3 + x?+ 4x = 0 (mod 5) 
has the same solutions as (b). Since 
xi — x = (x+ + 4x3 + x? + 4x)(x — 4) + 5x3 + 15x) 
congruence (b) has four incongruent solutions. 
=m COROLLARY 6.1 / Let p be a prime number and let 7 be a positive 
divisor of p — 1. The congruence 
x" — 1 =0 (mod p) 


has exactly n incongruent solutions. 
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PROOF: It follows from the algebraic identity 
RPS (AP Pt 1 I Oe PE ee oh x 


that the remainder is zero when x? — x is divided by x” — 1. 


EXERCISES 


1. Use the Euler-Fermat theorem to reduce f(x) = 0 (mod p) to give g(x) = 0 
(mod p) where g(x) has degree less than p — 1. Shown that g(x) = 0 (mod p) 
does not have the same solution sets as f(x) = 0 (mod p). 

(a) f(x) =x? + 2x* + 3x9 — 2x? -—x,p=5; 

(b) f(x) = 3x® — x9 + 3xt 4x24 x7? -—x4+2,p=S. 


2. Use Theorem 6 to determine if the following polynomial congruences 
have n incongruent solutions. 

(a) x* — 2x? + x?-—3x + 1 =0 (mod 7), n = 4. 

(b) 5x* — x9 + 4x? + 4x + 3 =0 (mod 11), n =4. 


3. Show that the congruence x? + a=0O (mod p) has only the solution 
= —a (mod p). 

4. Suppose f(x) = a,x? + a,_ 4x? ~* +++» +.a,x +a) where a) $ 0 (mod p) 

and a, —-, +a) =0 (mod p). Prove that f(x) = 0 (mod p) has at most p — 3 

incongruent solutions. 


46/AN IMPORTANT THEOREM 


A second important property of Euler’s totient function will be estab- 
lished in Theorem 7. Before proving this theorem we introduce a new 
symbol. 


a» NOTATION / If 7 is a positive integer and « is a function defined on the 
positive integers, the symbol “‘)' «(d)”’ means the sum of all of the numbers 


d|n 
a(d) as d ranges over the positive divisors of 7. 
For example, if « = @ and n = 10, then 
>, o(4) = o(1) + g(2) + (5) + p(10) =14+1+44+4=10 
d\10 


If « is defined by «(m) = m and n = 15, then 
Y ad) = Yd=14+345415=24 
d\15 


dj15 
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One observation will be made at this time: 


 a(a) =, o(5) 


d|n d|n 


This is obvious if we notice that as d ranges over all divisors of n, so does n/d. 


=» THEOREM 7/ If nis a positive integer, then )) p(d) =n. 
d|n 


We will prove the theorem by partitioning the set {1, 2, ---, 2} into non- 
overlapping subsets, each of which contains @(d) elements, where d is one 
of the divisors of n. The reader will probably have a better understanding of 
the proof if he verifies it for a particular example (see Exercise 1). 


PROOF: Let d,, d,, ---, d, be the positive divisors of n. If k is a positive 
integer, then the greatest common divisor of k and n is one of the d,. For 
fixed d, we will determine how many integers k from the set {1, 2, ---, m} have 
d,; as greatest common divisor with n. 

Suppose (k,n) =d;. We can write k =k’d,, n =n'd,, where (k’,n') = 1. 
Since 1 < k <n, then 1 < k’ <n’. Thus, k’ is a number in the set {1, 2, ---, n’} 
which is relatively prime to n’. Since there are p(n’) = o(n/d;) such numbers, 
then there are p(n/d,) values of k (since k = k’ - d,) in the set {1, 2, ---,”} which 
have d; as the greatest common divisor with n. 

Since each of the 7 elements in the original set is in one of these subsets, 


Using the remark preceding the statement of the theorem, we obtain 
n 
» oa) =)' 9 *) i 
d|n d|n 


EXERCISE 


1. Verify the proof of Theorem 7 for the case n = 65. 


4.7 | THE EXPONENT TO WHICH a BELONGS (mod m) 


Let m be a positive integer and let a be relatively prime to m. We know 
that w(m) is a solution of the congruence a* = 1 (mod m). In certain problems 
it is necessary to know the smallest positive solution of this congruence. 
For example, this number plays a fundamental role in the theory of decimal 
expansions of rational numbers. 
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m DEFINITION / We say that a belongs to the exponent k (mod m) if k is 
the smallest positive integer x such that a* = 1 (mod m). 


Example I. 


If we consider the powers of 5 (mod 7), we obtain 


5! = 5 (mod 7) 
5? = 4 (mod 7) 
53 = 6 (mod 7) 
5* = 2 (mod 7) 
5° = 3 (mod 7) 

© = 1 (mod 7) 


Thus, 5 belongs to the exponent 6 (mod 7). 


The reader should observe that the powers of 5 form a reduced system 
of residues (mod 7). 


Instead of considering all of the powers of the integer a, it is convenient 
to restrict the exponents that need to be considered by using the following 
theorem and its corollary. 


=» THEOREM 8/ If a’ =1 (mod m) and a belongs to the exponent k 
(mod m), then k|t. 
PROOF: We divide ¢ by k, obtaining 
t=kq+r O<r<k 
Then 
t=a=a"t*"=(a*)a" = 14: a" = a" (mod m) 
Since r is less than k, the smallest positive solution of a* = 1 (mod m), then 
r must be zero. Thus, t =k-q, and so kt. 


=» COROLLARY 8.1 / If a belongs to the exponent k (mod m), then k|g(m). 


Example 2. 


Find the exponent k to which 19 belongs (mod 47). 
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SOLUTION: Since 9(47) = 46, &k must be one of the numbers 1, 2, 23, 46. Com- 
puting the powers of 19, we find 


19? = 361 = 32 (mod 47) 
19* = 1024 = 37 (mod 47) 
19® = 1369= 6 (mod 47) 
19°*= 36 (mod 47) 


Thus, 197? = 19'!©- 194- 192-19 = 36-37-32:-19= —1 (mod 47), and 
194° = (— 1)? = 1 (mod 47), so that 19 belongs to the exponent 46 (mod 47). 


When finding the exponent to which a belongs for a composite modulus 
it is usually easiest to use the following theorem. The proof is left for the 
reader (Exercises 1 and 2). 


=» THEOREM 9/ If a belongs to the exponent s (mod m) and to the ex- 
ponent ¢ (mod n), then a belongs to the exponent [s, ¢] (mod [m, n)]). 


Example 3. 
Find the exponent to which 19 belongs (mod 7 : 47). 


SOLUTION: Since 19 = 5 (mod 7) then from Example 1 it follows that 19 belongs 
to the exponent 6 (mod 7). We found in Example 2 that 19 belongs to the exponent 
46 (mod 47). From Theorem 9 it follows that 19 belongs to the exponent [6, 46] =138 
(mod 7 : 47). 


EXERCISES 


1. Prove Theorem 9. 


2. Extend the result of Theorem 9 by induction; that 1s, prove: if a belongs 
to the exponents k,, k,, ---, k, modulo m,, m,, ---, m,, respectively, then a 
belongs to the exponent [k,, k,, ---, k,] (mod [m,, my, ---, m,]). 


3. Find the exponent to which 10 belongs (mod 37 - 7-11: 13). 


PROBLEMS 


1. Let p be a prime number. If a belongs to the exponent k (mod p), prove 
that a belongs to either the exponent k or the exponent kp (mod p7’). 
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2. Let p be a prime number. If a belongs to the exponent k (mod p"), where 
n> 1, and a does not belong to the exponent k (mod p"**), then a belongs 
to the exponent kp (mod p”* '), to the exponent kp? (mod p” * 2), ---, to the 
exponent kp’ (mod p" */), ---. The restriction n > 1 is only needed when p = 2. 


48 / PRIMITIVE ROOTS 


= DEFINITION / The integer a is called a primitive root (mod m) if a 
belongs to the exponent g(m) (mod m). 


For example, in Section 4.7, we established that 19 is a primitive root 
(mod 7) and (mod 47). 

Primitive roots play a key role in many theoretical investigations. For 
example, we will use the fact several times in Chapter 9 that primitive roots 
exist if m is a prime. In the appendix to this chapter we will use primitive 
roots to define functions similar to the logarithm function which can be used 
to reduce certain nonlinear congruences to equivalent linear congruences. 

In this section we will prove that primitive roots exist if the modulus m is 
a prime. Our method will be to determine the number of primitive roots for 
this case. 


m LEMMA / Let p be a prime number and let d be a divisor of p — 1. The 
number of incongruent numbers belonging to the exponent d (mod p) is 
either zero or o(d). 


PROOF: Let a belong to the exponent d (mod p). The numbers a, a’, ---, a4 


are incongruent (mod p) and each is a solution of 
x4 — 1 =0 (mod p) (4.1) 


It follows from Lagrange’s theorem (Theorem 10 of Chapter 3) that these 
are the only solutions of (4.1). Thus, any number belonging to the exponent 
d (mod p) must be congruent (mod p) to one of these numbers. Obviously, 
the number a* belongs to the exponent d if, and only if, k is relatively prime 
to d. Thus, there are exactly y(d) numbers in the above set belonging to the 
exponent d (mod p). 


= THEOREM 10/ If p is a prime number and d Is a positive divisor of 
p — 1, there are exactly o(d) incongruent numbers belonging to the exponent 
d (mod p). 
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PROOF: For each integer d, 1 <d<p-—1, let a(d) be the number of 
integers k, 1 < k < p—1, which belong to the exponent d (mod p). From 
Theorem 8 we obtain the fact that 


a(d) = 0 if dy)(p — 1) 
from the lemma, that 
0 < a(d) < gd) if d|(p — 1) 
and from Theorem 7 that 


» 9@)=p-i 
d\(p — 1) 
Because each of the p — 1 integers in the range 1 < k < p —1 belongs to 
some exponent (mod p), then 

p—1=o(1)+0(2)+--+a(p-1)= Yo ad)< Yi od)=p-1 

d|(p — 1) d|(p — 1) 

Thus, «(d) = g(d); the number of integers belonging to d (mod p) is equal 
to p(d), if d|(p — 1). 


=» COROLLARY 10.1 / If p is a prime number, there are exactly o(p — 1) 
incongruent primitive roots (mod p). 


Theorem 10 is surprising to many students. A natural conjecture is that 
a(d) is a function of both d and p. However, Theorem 10 informs us that 
the value of «(d) depends only on d (providing d divides p — 1). For example, 
since 7 divides p — 1 when p = 29, 43, 71, 113, and 127, then for each of 
these primes there are exactly six incongruent numbers belonging to the 
exponent 7. 

Primitive roots are important in number theory primarily because of the 
following theorem. The proof is left for the reader. 


=» THEOREM 11/ Let g be a primitive root (mod p). The numbers 
g,g’, +, g" * form a reduced system of residues (mod p). 


EXERCISES 


1. Find all primitive roots (mod m) for m =7 and m = 14. 
2. How many primitive roots exist (mod 28)? 


3. Prove Theorem 11. 
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4.9* | EXISTENCE OF PRIMITIVE ROOTS (mod m) 


In this section we will prove that certain moduli have primitive roots and 
all others do not. The results are summarized in Corollary 13.1. 


m DEFINITION / The positive integer ¢ is said to be a universal exponent 
for the modulus m if 


a’ = 1 (mod m) 
for all a relatively prime to m. 


For example, the Euler-Fermat theorem states that g(m) is a universal 
exponent for m. 


m DEFINITION / The function J is defined as follows: 


(1) Am) = o(m) if m = 2, m = 4, m = p*, or m =2- p", where p is an 
odd prime, 

(2) A(2*) = 74 92’) if « = 3, 
and (3) A(2*p,%! --> p,**) = [A(2”), A(p1"), °°, ACp2*)] if « = 2 or k = 2 and 
the p; are distinct odd primes. 


Example. 


Since 4400 = 2% - 57-11, then 
d(4400) = [A(2*), AGS), AUD] = [4¢@2%), 957), pA) 
= [4, 20, 10] = 20 


m THEOREM 12/ A(m) is a universal exponent for m. 


PROOF: (Part I) If m =2, m =4, m =p’, or m = 2p’, the proposition to 
be proved is identical to the Euler-Fermat theorem. 

(Part II) Suppose m = 2* (a = 3). Let a be relatively prime to 2*. Then a is 
odd and so we can write 


a=4n+1 


The proposition will be proved by induction on «a. 
If « = 3, we obtain 


a8) — q* = 16n? + 8n + 1 = 1 (mod 8) 


Assume that a? =1(mod2") fork >3 
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We rewrite this congruence as the equation 
Gres 145+ 2* 
and square, obtaining 
a? 42 = 1+ 2b-2* + b? - 27* = 1 (mod 2* * *) 


Since A(2** *) = 74 o(2* * 1) = 14 - 29(2") = 2+ A(24) 
then a2**") = 1 (mod 2* + +) 


From the principle of mathematical induction, it follows that 
qh?” = 1 (mod 2), “= 3, 4, — 


(Part III) Suppose m = 2%p,“' --- p,’* where either « > 2 or k > 2. If a is 
relatively prime to m, a is relatively prime to each of the moduli 2%, p,”', ---, p,"*. 
From the previous parts of the proof, it follows that 


a*(") = 1 (mod 27) 


a*P") = 1 (mod p,”') 


qrlpr**) =| (mod D2) 


Since A(m) = [A(2"), A(p1""), +++, A(pe"*)], then 
a*™ = 1 (mod 2”) 


a*™ = 1 (mod p,%) 


a) = 1 (mod p,”*) 


Since a“) — 1 is acommon multiple of the moduli 2’, p,", ---, p,*, it follows 
that the least common multiple of these numbers divides a**” — 1. Therefore, 


a = 1 (mod 2*p,* «++ py") 


Theorem 12 is frequently useful when computing the exponent to which a 
belongs (mod m). For example, since 4(4400) = 20, 3 must belong to one of 
the exponents 1, 2, 4, 5, 10, 20 (mod 4400). 


=m COROLLARY 12.1 /If m=2*% («@=3) or m=2%p,"'--- p,’*, where 
a >2 ork = 2, there are no primitive roots (mod m). 


SKETCH OF THE PROOF: We must show that in these cases A(m) < g(m). 
The details are left for the reader (Problem 1). 
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Thus, we can have primitive roots (mod m) only in the following cases: 
m=2,m=4, m = p*, or m = 2p’, where p is an odd prime. In Theorem 13 
we will prove that primitive roots exist for all of these cases. In order to 
simplify the proof we prove two lemmas. 


= LEMMA 1 / Let p be an odd prime and let k be a positive integer. If g 
is a primitive root (mod p*), then g belongs to the exponent (p*) or to the 
exponent o(p* * ') (mod p**?*). 


PROOF: Suppose g belongs to the exponent n (mod p* *'). By Theorem 8 
we know that n|p(p* * 1). Since g belongs to the exponent g(p") (mod p*) 
and g" = 1 (mod p*), it follows that o(p*)|n. Since p(p* * 1) = pe(p*), n must 
be either g(p") or p(p* *'). 


= LEMMA 2/ Let p be an odd prime. There exists an odd integer that is a 
primitive root (mod p) and (mod p’). 


PROOF: Let g be a primitive root (mod p). We may assume that g is odd; 
otherwise, g + p is an odd primitive root and we may use it instead of g. 
If g?) # 1 (mod p”), then by Lemma 1, g is a primitive root (mod p’). 

If g?) = 1 (mod p”) we will consider the integer g + 2p, which is also 
an odd primitive root (mod p). Using the binomial theorem, we obtain 


(g+ 2p)? = g?'?) as ("| gem as 12D 
re ("ae = (2p)? eae 2 (2p)??? 


= 9%) 4 ee g?) ~ 12 (mod p?) 
= 1+ (p— 1g? ~ *2p (mod p’*) 
= 1 — 2pg)~ * (mod p”) 
Since 2g?) — 1 is relatively prime to p, then 
(g + 2p)? # 1 (mod p’) 


From Lemma | it follows that g + 2p is a primitive root (mod p) and 
(mod p”). 


a THEOREM 13 / Let p be an odd prime number. If m = 2, m = 4, m = p*, 
or m = 2p", then primitive roots exist, modulo m. 


PROOF: (Part I) If m = 2 or m = 4, we can verify from the definition that 3 
is a primitive root (mod m). 
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(Part II) Suppose m = p* where p is an odd prime. By Lemma 2 there 
exists an odd integer g that is a primitive root (mod p) and (mod p”). We 
make the inductive assumption that g is a primitive root (mod p*) for 


a=1, 2, ---, k (k = 2). We will now prove that g is a primitive root (mod 
k+1 
pe’ *). 


Suppose g is not a primitive root (mod p**'). From Lemma 1 it follows 
that 


g??) = 1 (mod p***) (4.2) 
Because g is a primitive root (mod p*~') and (mod p*), it follows from 
Lemma | that we can write 
ger) = 1 4+ mp k- m #0 (mod p) (4.3) 
If we raise both sides of this equation to the power p, we obtain 
g? 9") = (1 + mp*~ 1)? =1+ mp" (mod p***) 
Since po(p* ~ ') = g(p*), then from (4.2), 
go) =] = gr N=] 4 mp* (mod p* + 1) 


Thus, m=O (mod p), contradicting Equation (4.3). Because our initial 
assumption was false, then g??") #1 (mod p**!'), which, by Lemma 1, 
implies that g is a primitive root (mod p* **). : 

By the principle of mathematical induction, it follows that the odd integer 
g is a primitive root (mod p*) (a = 1, 2, -:-). 

(Part IIT) Suppose m = p*. Let g be an odd primitive root (mod p*). Then 
g is also a primitive root (mod 2). By Theorem 9, g belongs to the exponent 
[1, p(p*)] (mod [2, p*]). Since p(2p*) = e(p*), then g belongs to the exponent 


~(2p*) (mod 2p"). 


The results of Corollary 12.1 and Theorem 13 are summarized in the 
following corollary: 


=» COROLLARY 13.1 / Let m be an integer greater than 1. Primitive roots 
exist (mod m) if, and only if, 


where p is an odd prime number. 


PROBLEM 


1. Prove Corollary 12.1. 
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APPENDIX / THE THEORY OF INDICES 


In Theorem 11 we established that if p is a prime number and g is a primitive 
root (mod p), the numbers 


9; 9g’, mee, g? 1 


constitute a reduced system of residues (mod p). Thus, if a is relatively prime 
to p, there is a unique integer k, 1 < k < p — 1, such that g* = a (mod p). 


m DEFINITION / Let g be a primitive root, modulo the prime p, and let a 
be relatively prime to p. The smallest positive integer k, such that g* =a 
(mod p), is called the index of a (with respect to the primitive root g for the 
prime p), and is denoted by the symbols 


k =ind, a =ind a 
Example I. 
Construct a table of indices for the prime 13. 


SOLUTION: We base our table on the primitive root 2. The powers of 2 (mod 13) 
are 


Thus, our table is 


n 1 2 3 4 5 6 7 8 9 10 11 =612 
ind 7” 12 1 4 Z 9 5 11 3 8 10 7 6 
The next theorem shows that indices have properties similar to those of 


logarithms. We will first prove a useful lemma that establishes a corre- 
spondence between numbers (mod p) and indices (mod p — 1). 


m LEMMA / Let g bea primitive root, modulo the prime p, and let a be rela- 
tively prime to p. Then g‘ = a (mod p) if, and only if, t = ind, a (mod p — 1). 
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PROOF: If ¢ = ind, a (mod p — 1), we can write 


t=inda+m:(p-—1) 
Thus, 
g’ = gies + m(p — 1) = gint4(gP - ve = gare = q (mod p) 


On the other hand, if g' = a = gi"** (mod p), then 
g' — inda| =| (mod p) 


Since g belongs to the exponent p — 1 (mod p), it follows from Theorem 8 
that 


t = ind a (mod p — 1) 


=» THEOREM 14/ If 7 is a positive integer and a and b are not divisible 
by the prime number p, then 

(a) ind ab = ind a + ind b (mod p — 1) 
and (b) ind a” =n ind a (mod p — 1). 


The proofs are similar to the corresponding proofs for logarithms and are 
left for the reader (Exercises 2 and 3). 


Example 2. 


Solve 5x'?° = 7 (mod 13). 


SOLUTION: Using Theorem 14 and our table of indices for the prime 13, we 
obtain 
5x'!° = 7 (mod 13) 


ind (5x!°) = ind 7 
ind 5+ 10 ind x = ind 7 (mod 12) 
10 ind x = ind 7 — ind 5 
11 —9 =2 (mod 12) 


5 ind x = 1 (mod 6) (cancellation law) 
ind x= 5 (mod 6) 


Thus, ind x = 5 or ind x = 11. Using our table again we find the solutions of the 
original congruence are 
x = 6 (mod 13) 


or x = 7 (mod 13) 
Compared to logarithms, indices have two defects: (1) Tables must be 


constructed for each prime modulus [and there are g(p — 1) possible tables 
for each prime p]; (2) the entries in the tables are not in ascending order as is 
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the case with logarithms. Thus, it is convenient to have two sets of tables, 
one to convert indices to numbers and one to convert numbers to indices. 
Such tables of indices for the primes less than 100 are found at the end of 
the book. 


Example 3. 


Prove that the congruence 23 x* = 17 (mod 71) has no solution. 


SOLUTION: ind (23 x°) = ind 17 
5 ind x = ind 17 — ind 23 = 49 — 15 = 34 (mod 70) 


Because the greatest common divisor of 5 and 70 does not divide 34, this congruence 
has no solution. 


Example 4. 
Solve 23 - 5* = 33 (mod 71). 


SOLUTION: The above congruence is equivalent to 
x ind 5 = ind 33 — ind 23 (mod 70) 
28 x = 57—15=42 (mod 70) 
2x = 3 (mod 5) 
x=4 (mod 5) 
Tables of indices can be used to determine the exponent to which an 
integer belongs (mod p) and thus can be used to find all primitive roots 


(mod p). The method is given in the statement of the following theorem. 
The proof of the theorem is left for the reader (Problem 1). 


=» THEOREM 15/ If a is not divisible by the prime p, then a belongs to 
the exponent 


~(P) 
(~(p), ind a) 


(mod p). 


m COROLLARY 15.1 / The integer a is a primitive root (mod p) if, and 
only if, 


(p(p), ind a) = 1 
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Example 5. 
Calculate the exponent to which 10 belongs (mod 89). 


SOLUTION. From the tables we find that ind 10 = 86. Thus, (¢(89), ind 10) = (88, 
86) = 2 so that 10 belongs to the exponent 


7(89) 


44 = (5(89), ind 10) 


(mod 89) 


EXERCISES 


1. Find all solutions of the following congruences: 
(a) 14x72 = 81 (mod 97). 
(b) 14x’* = 83 (mod 97). 
(c) 8+ 50* = 41 (mod 59). 


2. Prove Part (a) of Theorem 14. 
3. Prove Part (6) of Theorem 14. 
4. Prove that ind, g = | and ind, 1 =p — 1. 


PROBLEMS 


1. Prove Theorem 15. 


2. State necessary and sufficient conditions (in terms of indices) which insure 
that the congruence 


ax’=c (mod p) 


has a solution. Determine the number of incongruent solutions. Prove your 
result. 


DECIMAL EXPANSION 
OF RATIONAL 
NUMBERS 


5.1 / INTRODUCTION 


One of the most surprising facts about the history of mathematics is the 
slowness with which important concepts were developed. The ancient 
societies experimented with number systems for centuries before the Hindus 
invented the positional notation. To us, having great hindsight, the next 
logical step is the use of the positional notation in writing fractions. 

Although the Hindu notation was introduced in Europe in the early 
thirteenth century, it was not until the end of the sixteenth century that decimal 
fractions began to be adopted and not until the beginning of the eighteenth 
century that the usage was universal in Europe. 

Actually, the decimal notation had been anticipated over the centuries. The 
ancient Babylonions had a positional notation based on the number 60 which 
they could use to represent fractions.’ In the fourteenth century, Oresme, 
Bishop of Normandy, invented a decimal fraction system which was never 
adopted, and in the middle of the fifteenth century Regiomontanus (Christian 
name: John Muller) of Germany used a decimal-like notation in trigono- 
metry. In 1530 Christoff Rudolff of Vienna, in a book of calculating examples, 
used a decimal notation similar to the one now used. 

Simon Stevin, a Belgian of many scientific interests, was the major force 
behind the adoption of decimal fractions. In La Disme, published in 1585, he 
introduced a cumbersome decimal notation and systematically developed the 
properties of decimal fractions. The book included an essay in which Stevin 
strongly urged the adoption of the decimal system, pointing out the advantages 
for bookkeeping, monetary units, and systems of weights and measures. 

The present simplified decimal notation was first used in 1617 by the Scot- 
tish mathematician Napier, the inventor of logarithms. 


1The sexagesimal system was used in Europe until the Middle Ages for the 
extraction of roots. 
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Let us review the basic principles of decimal notation. When we write 
“0=N.a,a,Qa3°°: 
where n is a nonnegative integer and a,, a>, --- are integers in the range 
0 < a; < 10, we mean that the series 
Ga, _ a2 a3 
es + — + eee 
10 107 10° 


converges to the real number a. The proof that any nonnegative real number « 
can be represented in this manner is sketched in the problem section. 

We could, of course, use a base other than 10 for our number system. A 
practical method of converting from base 10 to a different base is illustrated by 
the following example. 


Example. 
Convert 0.419 to base 3 notation 


SOLUTION: We write (using base 10 notation) 


ay, a2 a3 
Oe tia has 0<a,<3 
a2 a3 
Thus, 30.4) =1.2=a+ 3+ 5+ 


implying that 
a2 a3 
a,=1 and O2=Z +324 -~ 
Multiplying the latter expression by 3, we obtain 
a3 a4 
3(0.2) = 0.6 = A2 + 3 +- 32 + Seis 
so that 


a,=0 and 0.6=—-+ = 
Continuing in this manner, we obtain 


a4 as 
3(0.6) =1.8=a; Tat 3a t cee 


aa as 
az=1 and 0.8 = 3 +52 + eee 
as a6 
and 3(0.8) =2.4=a+>+ 35+ eee 
As aé 
so that a=2 and 04=—-4+—54+-- 
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Comparing the form of the two series for 0.4, we see that if we continue, we will 
obtain 


a, =as=—ag = 413=°:: = 1 
a2 =Ag = A109 = a4 =: =0 
a3 = a7 —-ai1=as5=:: = 1 
a4 = Ag = 412 = a16 =: =2 
Thus, 0.410 = (0.101210121012 ---)3 


In actual practice, rather than writing out each of the above steps we use the 
following scheme: 


In the remainder of this chapter we will study the decimal expansions of 
rational numbers. We will show that the expansions are periodic, establish 
necessary and sufficient conditions which insure the expansion to be finite, 
and, finally, determine the period and the position in which the repeating 
block begins. 

It is obvious that if k, m, n, and t are positive integers, then 

DS ee ane OS 
n n in 


all have the same decimal part in their expansions. Thus, we may restrict 
ourselves to studying the expansions of proper fractions that are reduced to 
lowest terms; that is, we will consider the expansion of m/n, where 1 < m<n 
and (m, n) = 1. 


EXERCISE 


1. Convert the following fractions to base g notation: 
(a) (2%)10.9 = 12. 
(b) 0.132,5,g9 =7. 
(c) 41.132,5,g =7. 
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PROBLEMS 


1. Let m be a nonnegative integer and a,, a2, a3, --- be integers in the range 
0 < a; < 10. Prove that the series 


ji 2 eg 
10 10? 10° 


converges to a real number a. 
2. Let « be a positive real number. Prove that there is a series 


p22 ga: Heaei0 


that converges to a. [Hint: Let n =[a]. Define the sequences {a;}7_ , and 
{a;};°_- , inductively as follows: 


a, =a—Nn, a, =[10a,] 


X> = 10a, — Qi; a> = [10«, ] 


a, = 10%, ; -— a -1, ay, = [100,] 


3. Devise a rigorous proof that the method illustrated in the Example can 
always be used to convert fractions from base 10 to base g notation. 


5.2 / FINITE DECIMAL EXPANSIONS 


In this section we will establish necessary and sufficient conditions which 
insure that the decimal expansion of m/n is finite. 


= THEOREM 1 / Let mand zn be relatively prime, 1 < m <n. The decimal 
expansion of m/n is finite if, and only if, n is of form 2%5° (where « and f are 
nonnegative integers). If this is the case, the length of the expansion is equal to 


the larger of the numbers « and f. 
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PROOF: (Part I) If m/n has a finite decimal expansion, we can write 


m a, a a, 10°~ 4a, +10*~ 7a, 4+-+a 
— ay ye tp = 
n 10 10 10 10 
so that 
10°m = n(10*~* a, + 108-7 4+ --» +a) 
Thus, 


njl0'm and = (m,n)=1 


implying that n|10*. It follows from the unique factorization theorem that 
n = 2°5°, where « and B are nonnegative integers, not both zero, and each less 
than or equal to k. 

(Part II) Suppose n = 2*5°. For sake of argument, we will assume that B > «. 
Since n > 1, then Bf > 0. Thus, 


m_ 2°" *m 

n 10° 
We now write the integer 2° ~ “m in the Hindu-Arabic notation 
2 - 4m = ¢,10'+c,_ ,10°~ 1 + + +c410 + cy 


where each c; is one of the ten digits. Since 542° ~ %m, then 5} co, and so 
Co # 0. Furthermore, since 1 < m <n, it follows that B > t. Thus, 


m 2m «G 4 on Cy Co 
n 108 108-* 108 


Since cy # 0, the fth position in the decimal expansion is the last nonzero 
position. Thus, the expansion 1s finite, of length f. 

If we had assumed a = f, we would have established in a similar way that the 
expansion is of length a. 


5.3 / INFINITE DECIMAL EXPANSIONS 


In this section we begin our investigation of infinite decimal expansions. 


m DEFINITION / The decimal expansion a,/10 + a,/10? + a3/10° + --- is 
said to be periodic if there exist positive integers k and t such that 


QA = an+t =4,+21 ee 


Qe41 ~S&4141 ~UN+2t41 =°°° 


Q+1-1 ~ 421-1 =~ UN 43r-1 7°" 
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The smallest such ¢ is called the period of the expansion. If k = 1, the expan- 
sion is said to be purely periodic. 


Example. 


The expansion of 14 = 0.33333 --- is purely periodic with period 1. The expansion 
of 13/70 = 0.1857142857142 --- is periodic with period 6, but is not purely periodic. 


In order to simplify the notation, it is customary to write the repeating 
block of a periodic expansion with a bar over it rather than repeating it several 
times. Thus, we would write 


13 —————— 
— = 0.1857142 
70 
47 — 
and — = 0.267045 
176 


If the expansion of m/n is to be infinite, then n is not of form 2% 5°. For 
purposes of our notation, we let m, =m. 
Thus, we can write 


so that 
10m, =ayn+ (7 a5 a + )n 


Since 10m, and a,n are integers, then (a,/10 + a3/107 + ---)n is also an 
integer. Call it m,. Because the expansion is not finite, then 


a2 a3 


0O<—+-—54+:::<1 
10° 107 * 
and so 
az , 43 
0 Fone (pS Rs Ea 
<M), (B+ 34 Jn<n 
Thus, we have 
10m, =a,n+m, O<m,<n 


That is, a, is the quotient when 10m, is divided by n, and m, is the remainder. 
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We now repeat the process, obtaining 


10m, =a,n+m, O<m,<n 
a (S3 4. Oe 52. 
where mM; = (734 i02 + )n 


Eventually we obtain the chain of equations 


10m, =a,;n+m, O<m,<n 


10m, =a,n+m, O<m;<n 
10m, — a3n + MN, 0 < ms< n (5.1) 
10m, = a,n + my + 4 O< msi, <Nn 
Qk+1 , +2 
where m = fe aA 
ae (33 10? 


The reader should note that the above equations represent the steps in the 
familiar process of long division. For example, if m = 4 and n = 37, then in 
the long division process we obtain 


0.1081 --- 


37| 4.0000 --- 
37 


30 
0 


300 


Thus, in the chain of equations we have 
10m, = 40 =1-37+3 
10m, = 30 =0- 37+ 30 
10m, = 300 = 8 - 374+ 4 
10m, = 40=1-374+3 
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= THEOREM 2/ Let (m, n) =1, 1 < m <7, and let t be a positive integer. 
If, in the chain of equations, m, = m, , , for some positive integer k, then the 
decimal expansion of m/n is periodic and there is a repeating block of length ¢, 
beginning with a,. 
PROOF: Because m, = Mm, + 3; 
10m, = ayn + mM, +1 
=10m, 4 5 =A 4 M+ Me 41 


where 0 << m4, <nand0<m, 4,4 ,<4n. From the uniqueness of division, 
it follows that 


aA, = A+: and M+ 1 = M4141 


Thus, 10m, +; =AQ,454N + M+ 2 
=10mM 41 = We rei t Maes 


which implies that 


G41 =r r+ and My +2 = M+ 142 


Continuing in this fashion (this statement disguises a proof by mathematical 
induction), we see that if j is a nonnegative integer, then 


M+ jp = Me tj 
and Ge+ jp = U+ try 


Applying this result for the nonnegative integers, j, j + ¢, 7 + 2t, ---, we obtain 


G+ jp ~QNree Gg FN ae4 57 H+ 345 FO 


By letting j take on the values 0, 1, 2, ---, t — 1, we find that 
a, = Apr t = ak + 2 = Ay + 3t 


Q+1 ~Q4+e41. =~ 4+ 2041 ~ 4+ 3t41 =°" 


Q+-1-1 ~— U4 20-1 ~~ 4 31-1 ~4+4e-1 =F" 


Thus, 


m dee eee area 
yO Ata Ae 1 AA + 1 " Ak+t—1 


EXERCISE 


1. Write out the chain of equations (5.1) for m = 5,n = 28. 
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5.4 / PERIODICITY OF THE EXPANSION OF m/n: 
SPECIAL CASE 


In this section we consider the expansion of m/n where 1 < m <n and nis 
relatively prime to 10m. We will show that the expansion is purely periodic in 
this case and will determine the period. 


= THEOREM 3/If1 <m<n and nis relatively prime to 10m, the decimal 
expansion of m/n is purely periodic. If 10 = 1 (mod n), there is a repeating 
block of length & that begins with the first position of the expansion. 


PROOF: Since (10m, n) = 1 then (10, 2) = 1. Thus, there is an integer k such 
that 10 = 1 (mod 7). If we replace each of the equations in the chain (5.1) by 
a congruence (mod 7), we obtain 


10m, = m, (mod n) 
10m, =m, (mod n) 


10m, = m, (mod n) (5.2) 
10m, = m, , , (mod n) 


By substituting successively in the chain of congruences, we have 
10m, =m, (mod n) 
107m, = m; (mod n) 


10°m, = m, (mod n) 


/ 


10*m, = m, +, (mod n) 
Thus, since 10* = 1 (mod n), 

m, = 10‘m, = m, +, (mod n) 
and, since 0 < m, <n,0<m, 4, <n, this implies that 
Mm, =™ +1 


From Theorem 2 it follows that the expansion is purely periodic and a 
repeating block of length k exists that begins in the first position. 


We will now prove a converse to part of Theorem 3. 


& 


5.4 | PERIODICITY OF THE EXPANSION OF m/n a W 
m THEOREM 4 / Suppose that 1 < m <n, and 10m is relatively prime to x. 
If there is a repeating block of length k in the expansion of m/n, then 10* = 1 
(mod 7). 


PROOF: Suppose the repeating block begins in the tth position. Then 


Gy | A+1 Ap+k-1 a 
m= (T+ te p Se De tn 


10 107 

Gp+k , UW+k+1 At+2k-1 A,+k 
= (ght gt +o + gat + agit) 
= M; + &k 


From the chain of congruences (5.2), we obtain 
10m, 


107m, 


m, + 1 (mod n) 


m, + 2 (mod n) 


10m, =m, +4, =m, (mod n) 
From the cancellation law for congruences it now follows that 


10* = 1 (mod n) 


Thus, we have proved that if 1 <m<n and (10m, n) = 1, the decimal 
expansion of m/n is purely periodic and k is the length of a repeating block in 
the expansion if, and only if, 10 = 1 (mod n). Since the smallest such integer k 
is the exponent to which 10 belongs (mod n), then this number is the period of 
the expansion. We summarize these results in Theorem 5. 


=m THEOREM 5/ Suppose 1 < m <n and that (10m, n) = 1. The decimal 
expansion of m/n is purely periodic and the period of the expansion is equal to 
the exponent to which 10 belongs (mod n). 


Example. 


4 
Calculate the period of the decimal expansion of 7-89 


SOLUTION: In Example 5 of the appendix to Chapter 4, we found that 10 belongs 
to the exponent 44 (mod 89). Using tables of indices (or direct computation) we 
find that 10 belongs to the exponent 6 (mod 7). From Theorem 9 of Chapter 4 it 
follows that 10 belongs to the exponent [6, 44] = 132 (mod 7 - 89). Thus, the period 


of the expansion of is 132. 
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EXERCISE 
1. Calculate the periods of the decimal expansions of: 
(a) — 
“13-17-89” 
5 
(0) 9-11-59-97 


5.5 / PERIODICITY OF THE EXPANSION OF m/n: 
GENERAL CASE 


We have considered two special cases of the decimal expansion of m/n. 
In Section 5.2 we considered the case where 7 is of form 2% 5°, and in Section 
5.4 the case where v is relatively prime to 10. Before considering the general 
case we prove a lemma. 


= LEMMA / Let m, n,, n, be integers, n, and n, both greater than 1. 
Suppose that 1 < m < n,n), that (m, n,n,) = 1, and that (”,, n,) =1. There 
exist integers m,, m2, and q, 
l<m;<n; and (m;,n;) = 1 (i = 1, 2) 
such that 
m 


Mm, M2 
=—+—+q 
NynNa Ny No 


PROOF: Since 1, and 7, are relatively prime, there exist integers x and y such 
that 
m= xn, + yn, (5.3) 
We now divide x by n,, obtaining 
X=Nq, +m, O<m, <n, (5.4) 
Similarly, we write 
Y=NQG,.+M2, O<m, <n, 
On substituting in (5.3), dividing by n,n,, and simplifying , we obtain 
mem Mm, 
= where = 
nin, my Ny + 4q q@=41+ 42 
It follows from (5.4) and (5.3) that 
(m,,1,)|m 


Thus, (m,, ,)|(m, 1,), which implies that m, and n, are relatively prime. 
Since n, = 2, this means that 


l<m,<n, and (m,,n,) = 1 
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A similar proof shows that 
] < My < Ny and (m,, Nn») = ] 


= THEOREM 6/ Suppose (m, n)=1 and 1<m<n. Write n = 2*5'n’ 
where (n’, 10) = 1. The decimal expansion of m/n is periodic with period equal 
to the exponent to which 10 belongs (mod n’). The repeating block begins in 
position ¢ + 1, where t = max {a, f}. 


PROOF: If « = 6 = 0 or ifn’ = 1, the hypothesis reduces to that of Theorem 
1 or Theorem 5 (special cases of this theorem). 

Assume that not both « and f are zero and that n’ > 1. By the Lemma there 
exist integers m’ and m” such that 


4 " 


m__m im 

2*58n’ on’ #58 
where qg is an integer, 1<m' <n’, 1<m" < 2*5°, (m',n')=1, and 
(m”, 2°58) = 1. Since g is an integer, the decimal expansion of m/2%5°n' is 
determined by the expansion of 


+4q 


, ”" 


mm 
n’ * 2858 

From Theorem 5 we know that the expansion of m'/n’ is purely periodic with 
period equal to the exponent to which 10 belongs (mod n’). Although we 
usually consider the repeating block to start in the first position, we may start 
it in any position; in particular, the (¢ + 1)st position. From Theorem 1, the 
expansion of m"/2%5° is finite of length t. If we now add the two fractions we 
see that the th term of the expansion of m’/n’ is changed (and possibly other 
terms as well), although no term after the tth term is changed. Thus, the ex- 
pansion of m/2*5’n’ is periodic, with the period equal to the exponent to 
which 10 belongs (mod n’), and the repeating block begins in the (¢ + 1)st 
position, where ¢t = max {a, f}. 


EXERCISES 


1. Find the periods and the positions in which the repeating blocks begin for 
the decimal expansions of: 


3 

(2) 5732-3’ 
23 

(6) 790-99. 59-97" 


2. Show that the integer g is necessary in the statement of the Lemma by 
considering the lemma for m = 1, n, = 5, n, = 7. In general, what values may 
g assume? 


PERFECT 
NUMBERS 


6.1 / INTRODUCTION 


From ancient times magic properties have been associated with certain 
numbers. Some of the more favored numbers were 7 and 40 (as evidenced 
from their use in the Bible), 6 (the number of days necessary for the creation of 
the world), and 28 (the length of the lunar cycle). 

The ancient Greeks made extensive classifications of numbers according to 
their magic properties. One such classification depended on the sums of the 
proper divisors of the numbers (the so-called aliquot parts). A number is 
perfect if itis equal to the sum of its proper divisors (such as 6, 28, and 496), 
deficient if the sum is less than the number, and abundant if the sum is greater 
than the number. Two numbers are amicable if the proper divisors of each 
sums to the other (such as 220 and 284). 

Perfect numbers, being between abundancy and deficiency, signified perfec- 
tion and were greatly desired. Amicable numbers played an important role in 
astrological calculations involving love potions, talismans, and so on, 
especially among the Arabs of a later millennium. 

In Europe, numerology was taken quite seriously until modern times. For 
example, Alcuin (eighth century) noted that because the second origin of man- 
kind was associated with the deficient number 8 (the number of souls on 
Noah’s Ark), while the first origin was associated with the perfect number 6, 
the second creation was inferior to the first. As late as the sixteenth century the 
mathematician Peter Bungus worked with great industry to assign the number 
666 (the symbol of the Antichrist in Revelation XIII, 18) to Martin Luther.* 
This attack on Luther was, to some extent, provoked, since Luther’s friend 
Stifel had previously assigned the same number to Pope Leo X. 


‘Bungus assigned numbers to the letters of the alphabet as follows: a =1, 
b=2,---,k =10, /= 20, ---. After misspelling Luther’s name he found that 
M30, Ai Rso Tioo Io Nao L20 V200 Tioo Es Rao Ai sums to 666. 
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It will be proved in this chapter that an even number is perfect if, and only 
if, it is of a certain form. A few results concerning odd perfect numbers will be 
mentioned. 


6.2 /THE SIGMA AND TAU FUNCTIONS 


To facilitate our study of perfect numbers we introduce two functions 
defined on the positive integers. In number theory it is customary to call these 
functions arithmetic functions or number-theoretic functions rather than 
sequences. 


m DEFINITION / If 7 is a positive integer, we let t(n) denote the number of 
positive divisors of n, and o(n) the sum of the positive divisors. 


Example. 


Since 10 has the positive divisors 1, 2, 5, and 10, then 7(10)=4 and o(10) 
=14+2+5410=18. 
Using the notation of Section 4.6, we could have defined these functions by 


(n= 1, ofs)= Sd 
d|n d|n 


In this section we will obtain formulas to express 7(m) and o(n) in terms of the 
prime divisor of x. 


= THEOREM 1 / The functions o and t are multiplicative. 


PROOF: We must show that if m and n are relatively prime, then o(mn) = 
a(m)o(n) and t(mn) = t(m)t(n). 

Let m,, M2, *+*, Mm) be the positive divisors of m, and ny, No, -*+, Nap») the 
positive divisors of n. Consider the set of products 


{m,N,, M2N1, M3N,, °°, Mim 


MN2, M2N2, M3N2, *°*, Mame (6.1) 


MyNny> M2" n)> MN3Nwny> °° "s Mm Nyny} 


Each number in this set is obviously a positive divisor of mn. On the other 
hand, if dis a positive divisor of mn, then (d, m) is one of the divisors of m, 
say m,, and (d, n) is one of the divisors of n, say n;. Since d|mn and (m, n) = 1, 
it follows from the unique factorization theorem that d= m,n,. Thus, each 
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positive divisor of mn is in this set. Since we will count the elements of (6.1) 
we must be sure that no number is duplicated in the set. Suppose 


m,n; = mn, 


Because m and n are relatively prime, then m, and n;, are also relatively prime. 
Therefore, since m,|m,n,, it follows that m,|m,. In a similar way, we see 
that m,|m;. Thus, m, = m,, so that h = 7. Similarly, we see that i = k. Thus, 
the numbers in (6.1) are all different. 

Because the numbers in the set (6.1) are the positive divisors of mn, then 
t(mn) is equal to the number of elements in the set and o(mzn) is the sum of the 
elements. Therefore, 

t(mn) = t(m)t(n) 
and 
a(mn) = myn, + myn, + M3Ny + + + My + 


+ MyNg + MzN2 + M3yN2 + + + Mame + 


+ MyNym) + M2Nyny + M3N ayy Ht FH Many ny 
= (m, +My, +e + Mm (Ny + No 4- oes Ne(ny) 
= a(m)a(n) 


Thus, o and t are multiplicative functions. 
We will now be able to evaluate the o and t functions if we know the values 
of o(p*) and t(p*) for each prime p. 


= THEOREM 2 / If pisa prime number, then 


ati_y 
o(p) == 
p-1 
and Up") =a+1 


SKETCH OF THE PROOF: The proof follows trivially from the fact that the 
positive divisors of p* are 


1, P, p’, ar) p 


We can combine the results of Theorem 1 and Theorem 2 as follows: 


=» COROLLARY 2.1 / If p,, p2, ---, p, are distinct prime numbers, then 


ay,+i a2+1 +1 
1 —]1 2 — |] ak — 
o(p,"'p2” és Fa oe en i Sinan. Pet caiecas 

pPi-il p2—1 Pe — 1 


and 
T(Dy"'P2? -*> Pe“) = (@, + IO, +1) (| + 1) 
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Example. 
Calculate o(n) and 7(n) if n = 1,274,000. 


SOLUTION: Since 1,274,000 = 2% - 7? - 5° - 13, then 
a(n) = o(2*)0(77)a(5*)a(13) 


2§'—1 7?—1 5*-1 13°-1 


—— eee © eee © amen © 


1 6 4 12 
= 31 -57-156- 14 = 3,859,128 
and 
a(n) = 7(2*)r(72)7(5?)7(13) 
=5-3-4-2=120 


EXERCISES 
1. Calculate o(n) and t(n): 
(a) n=76; 
(b) n = 1460. 


2. Prove: If m and 7 are relatively prime positive integers and d is a positive 
divisor of mn, then 


d =(m, d)  (n, d) 


3. Prove: t(n) is odd if, and only if, n is a square. 


6.3 /EVEN PERFECT NUMBERS 


We can reformulate the definition of a perfect number as follows: 
= DEFINITION / A positive integer x is called a perfect number if o(n) = 2n. 


Let us examine the form of the three perfect numbers mentioned in Section 
6.1: 


6 = 23 2(2? — 1) 
28 =4-7 22(23 — 1) 
496 = 16-31 = 24(25—1) 
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We see that each of these numbers is of form 
2k — 1¢2* — 1) 


where 2* — 1 isa prime. The following theorem, due to Euclid, shows that any 
number of this form is perfect. 


= THEOREM 3/ If 2* — 1 is a prime number, then 
2k 19 — 1) 


is a perfect number. 


SKETCH OF THE PROOF: Use Corollary 2.1 to show that 
o(2* ~ 1(2* — 1)) = 2*(2* — 1) 
The details are left for the reader. 
Two thousand years after Euclid, Euler was able to prove that all even 


perfect numbers ate of the type found by Euclid. Euler proved the following 
theorem: 


= THEOREM 4 / Ifn is an even perfect number, there exists a prime 
number (2* — 1) such that 

n= 2*~ 102k — 1) 

PROOF: Since n is even we can write n = 2'm, where m is an odd integer. 
Since vn is perfect, then 
a(n) = o(2'o(m) = (2'* — 1o(m) 
= 2n (6.2) 
ts 3! +1 m 


Thus, (2' + ! — 1)| 2‘ * !m, implying, since 2‘ * 1 — 1 is odd, that (2° + * — 1)| m. 
Therefore, 


m=(2't! — 1)u 


where u is an integer. Suppose uw is greater than 1. Then 1, u, and (2'*! — 1)-u 
are distinct divisors of m, so that 


o(m)>1+u+(2't!—lu>2't*!u 
Therefore, 
a(n) = (2° +? — 1)o(m) > (2° +1 — 12°F 4u = 2'* !m =2n 
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which contradicts (6.2). Thus, vu = 1 and 
m= whi +1 l 


If we assume that m is not a prime, we get a contradiction similar to the one 
above. Thus, 

m= 2'*!— lisa prime 
and n=2'(2' t+! — I) 


If we let k = ¢ + 1, we have the desired result. 


As we have seen, our search for even perfect numbers has been reduced to a 
search for prime numbers of form 2" — 1. Such primes are called Mersenne 
primes after Martin Mersenne (1588-1648). We will prove one rather trivial 
result about Mersenne primes. 


= THEOREM 5/ If 2* — 1 isa prime number, then k is also a prime. 


PROOF: If k is not a prime, we can write k = ab, where a and b are both 
greater than 1. Thus, 


Qe — 1 = 2% — | = (2% — 1)(29O ~ 9) 4 20 4 4 29 + 1) 


so that 2" — 1 is not a prime. 


At the present time twenty Mersenne primes are known and there are, of 
course, twenty corresponding perfect numbers. The known Mersenne primes 
are 


M,=2?-1=3 M,3; =8,191 
M,;=23-1=7 #M,, = 131,071 

M; =31 M9 = 524,287 

M, =127 M3, = 2,147,483,647 
M61, Mgo, M07, M27, M521, M607, M1279 
M203» M2281, M3217, Maas3, Mgar3 


Very little is known about Mersenne primes. We do not know if their 
number is finite or infinite. If it is finite, we have no bounds for the largest such 
number and, consequently, we do not know if the above list is exhaustive. 


EXERCISES 


l 
1. Prove: Ifnisa perfect number, then )) — = 2. 
d|n 
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2. Show that the definition of amicable numbers can be reformulated as: 
m and n are amicable if and only if 
o(m) = o(n) =m+n 
3. Define the sequences P,, and Q, by 
P,=3-2"-1,0,=9:27"- 1-1 (n = 1, 2, ---) 
Show that if P,,_ ,, P,, and Q, are prime numbers, then 
G2 FQ 4P. and b =2"0, 
are amicable. 


4. Use Exercise 3 to find two sets of amicable numbers. 


PROBLEM 


1. Show that if a and b are even integers such that 
o(a) = 2b and o(b) = 2a 
then there exist Mersenne primes 2? — 1 and 2% — 1 such that 


a=21-1(27-1) and b=2?71(22-1) 


6.4* / REMARKS ON ODD PERFECT NUMBERS 


The theory of odd perfect numbers is not as well developed as that of even 
perfect numbers. To this date no odd perfect number has been discovered and 
many mathematicians believe that such numbers do not exist. Unfortunately, 
no one has been able to prove this conjecture. 

Most of the theorems proved about odd perfect numbers state conditions 
that such numbers must satisfy, many of them concerning the prime factors. 
Using these conditions it is sometimes possible to establish other results, such 
as bounds on the number of distinct prime factors, or bounds on the size of 
the numbers themselves. A few of the many established facts are summarized 
below. For a more detailed listing and for references to the following results 
the reader is referred to the survey paper by McCarthy in the list of special 
references. 

Euler obtained the first significant result on odd perfect numbers: If 7 is an 
odd perfect number, the v is of form 


2B1,, 262 


n= pq "'qo"? ** a 
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where P, 91, 42, °**, J; are distinct odd primes and 
p=a=1(mod 4) 


It has since been proved that it is impossible to have all of the B’s equal to 1, 
and Brauer and Kanold have proved that it is impossible to have one of the 
f’s equal to 2 and the others equal to 1. Kanold proved that if B, = BB, =-:- 
= B, = 1, thent > 9andn > 10°°. If, in addition, « = 1, theng, = 3,28, >= 12, 
and t > 13. 

It was established in the last century by Sylvester that n must have at least 4 
distinct prime factors and if n #0 (mod 3), at least 7 distinct prime factors. 
This bound has since been improved. We now know that t = 5 if 3|n and 
t>8if 3) n. Kanold has proved, in addition, that n must have at least one 
prime factor greater than or equal to 61. 

An odd perfect number must be fairly large. Near the turn of the century it 
was established that if 2 is an odd perfect number then n > 2: 10°. This was 
improved to (1.4)-10'* by Kanold. In the 1956 Supplement to his book 
Number Theory and Its History, Ore announced that one of his students, 
J. B. Muskat, had raised this bound to 10'8. 


PROBLEMS 


1. Let p be a prime. Prove that ifm = p*, then 7 is not an odd perfect number. 


2. Let p and q be distinct odd primes. Prove that p%q’ is not a perfect number. 
(Thus, an odd perfect number must have at least 3 distinct prime factors.) 


3. Prove Euler’s result: If 2 is an odd perfect number, then 
n = p"qy?1qz*¥? «+» gy 


where P, 91,92, °°» g; are distinct odd primes and p = « = 1 (mod 4). 


ARITHMETIC 
FUNCTIONS 


7.1 / INTRODUCTION 


In the preceding chapters we have studied three arithmetic functions, oa, t, 
and @, and have observed that they are associated with sums of form ¥° a(d); 
that is, d\n 

> 9(d) =n (Theorem 7 of Chapter 4) 
d|n 


y1=t(n) (Definition) 


d|n 


and ¥:d=o(n) (Definition) 
d|n 


In this chapter we will study arithmetic functions with particular emphasis 
on sums of the above type. A new notation will be introduced in order to 
facilitate our work. 


7.2. / THE CALCULUS OF ARITHMETIC FUNCTIONS 


We now define a “binary operation’’’ on the collection of arithmetic 


functions that will produce a third function from any two given functions. 
We will use a product notation to indicate this operation. 


1A function from the set of all ordered pairs of elements of a set S to S is 
called a binary operation on S. If *“‘O’’ denotes the binary operation, it is 
customary to write ‘“‘aOb=c’’ rather than “* O(a, b) = c.”’ 

The binary operation “‘©”’ is said to be associative if aO(bOc)= 
(a0 b)OCc for all a, b,c in S. It is said to be commutative if aOb=bOa 
for all a, bin S. 

Examples of binary operations are: 

(1) addition and multiplication of numbers (commutative and associative); 
(2) subtraction of numbers (neither commutative nor associative); 
(3) multiplication of nth order matrices (associative, but not commutative). 


102 


7.2 | THE CALCULUS OF ARITHMETIC FUNCTIONS a 103 


m DEFINITION / Let « and f be arithmetic functions. The function a. B 
called the convolution product of « and B, is defined by 


(# » BXn) = ¥ a(") 


d|n 


Example. 


Calculate (9 o 7) (”) (n = 1, 2, 3, 4). 


1 
SOLUTION: (9 0 T)(1) = > v(a)r(5) = g(1)7(1) = 1 
dj 


2 
(0792) = X old)r (5) = 9172) + pr) 


=124+11=3 
(y o T)(3) = w(1)7(3) + v(3)7(1) 
=12421=4 


(p o T)(4) = p(1)7(4) + G(2)7(2) + 9(4)701) 
=1341-2421=7 


The function ¢ © 7 will be considered again in Problem 1 and in Exercise 1. 


The use of this notation leads to a convenient calculus of arithmetic func- 
tions. We will establish several useful properties of convolution multiplication 
that will enable us to manipulate these functions with comparative ease. 


=m THEOREM 1 / Convolution multiplication is commutative; that is, 
Ao B = B o O 


PROOF: Let 1 be a positive integer. As d ranges over the divisors of n, so 
does n/d. Let d’ = n/d. Then 


(%- Bn) = ¥ cade () = > (5) oa 


d\n 


= ¥ Bla’ya( 5) = (B » a) 


d’|n 


Since (a. B)(n) =(B .a)(m) for each positive integer n, the functions 
a. Band B . ware equal. 
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In the next theorem we will establish one of the most important properties 
of convolution multiplication: the associative property. Many of the other 
properties (such as the MObius inversion formula of Section 7.4) are conse- 
quences of this property. 

Before proving Theorem 2 we make an observation that will be used in 
several of the proofs that follow: Many sums can be rewritten as double sums. 


For example, 
Let a and B be arithmetic functions and let m and n be positive integers. Then 


2 a(d)B(D) = 2 a(d) 2B) 
D{n 


PROOF: Let d,, d>, ---, d,and D,, D,, ---, D, be the positive divisors of m and 
n, respectively. Then 


> «(d)B(D) = ye __ Madu) 


d|m J= 1, ) 
D\n k= 1.2... 


= o(d,)B(D,) + a(d2)B(D,) + --- + a(d,)B(D,) 
+ a(d,)B(D2) + «(d2)B(D2) + --- + a(d,)B(D2) 


+ ad, )B(D,) + (ad2)B(D,) + +++ + a(ds)BD;) 
= {a(d,) + ++» + a(d,)} - {B(D,) + -- + BID} 
= yy a(d) > BOD) 


m THEOREM 2 / Convolution multiplication is associative; that is, if «, B, 
and y are arithmetic functions, then 


(40 Blo y= Go (Boy) 


PROOF: We must show that if n is a positive integer, then ((« o B) o y)(n) 
= (4 o (B o y)) (n). If we consider the first of these expressions, we see that 


n 
((a o B) o y(n) = D(a o B)(d) (5) 


= B(Zeor(=))y(9) 


= © Yas6(=),(5) 


d|n s|d 


Since s|d, we let d = st. Thus, the above sum is equal to the single sum 


¥.a(s)B(t)y (“) (7.1) 


st|n 
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We will now consider the expression (a o (B o y))(m). We write 
= Ya(d “\ = Ya(d 
(Bon) = Fata (5) = Zaed)( ¥ Bomn(-")) 


m|n/ 
=¥ ¥ alaponyr(—) 


d|n m|n/d 
This double sum indicates that we consider all positive integers m and dsuch 
that m|(n/d); that is, such that md|n. Thus, we can replace the double sum by 
a single sum and obtain 
n 
(2 » Bon) = Y a(ap(n)r(—™) 
md|n md 
which is identical to expression (7.1) for ((@ o B):0 y)(n) except for the labeling 
of the divisors of n. Thus, 
n n 
(0 B) own) = YralsyB(r(~) = Y a(aBeomy() = Ce = (B » MN 


st|n St md|n 


Since « o (B o y) and (a o B) o y have the same value for each positive integer n, 
the two functions are equal. 


Rather than write « o (B o y) or (@ o B) o y we will write ao Boy. 
We will next prove that the multiplicative functions form a special subset of 
the arithmetic functions. 


= THEOREM 3/ If « and £ are multiplicative arithmetic functions, so is 
Ao B ° 


PROOF: Let m and n be relatively prime positive integers. If dis a divisor of 
mn, d can be written as d = st where s|m and t|n. Obviously, 


(s,t)= =.) = | 


t 
Thus, 


(% © B\(mn) = > a(d)B 7") = or ( *) 


d|mn 
Because « and f are multiplicative, a(st) = a(s)a(t), 
mon m\ .(n 
(2-2) =a 
s t S t 
and the above sum is equal to 


Zaa(>) a(06(*) 


tin 
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which, in turn, is equal to the product of the sums 


Ya96(™) and rate (7) 


Therefore, 


(x » Bymn) = Fia(s)B (”) a(t)B (*) = (Zev (”)) (Sane (*)) 


t|n 
t|n 


= (a o B)(m) « (% o Bn) 
and a o Bis multiplicative. 
One additional definition will be made in this section. 
m DEFINITION / We say that the arithmetic function « is totally multiplica- 
tive if 
a(mn) = a(m)a(n) 


for each pair of positive integers m and n. 
Example. 


The function defined by «(n) =n? is totally multiplicative. The function o is 
multiplicative, but is not totally multiplicative. 


EXERCISES 


1. Let p be a prime number. Show that 


(pot(p)=ptil 
and (p ot)(p?)= p> + pti 


2. LetB =ToT ot. 
(a) Is B multiplicative? 
(b) Calculate B(n) for n = 1, 2, ---, 10. 


3. Let p be a prime and let f be the function of Exercise 2. Calculate B(p) and 
B(p’). 

4. Let a and B be totally multiplicative functions. Does « o B have to be multi- 
plicative? Totally multiplicative ? 
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PROBLEM 


1. Prove: got =o. [Hint: See Exercise 1.] 


7.3 / INVERSE FUNCTIONS UNDER CONVOLUTION 
MULTIPLICATION 
m DEFINITION / The function ¢ is defined as follows: 


e(1) = 1 
e(n) = 0 ifn>1 


The function ¢ is called the identity for convolution multiplication since 
EcoXA=HoE=4 (7.2) 
if « is an arithmetic function (see Exercise 2). 
We see from (7.2) that the function ¢ plays a role in convolution multiplica- 


tion that is similar to the role played by the number | in ordinary multiplica- 
tion. We are thus led to the following definition: 


= DEFINITION / Let « and £ be arithmetic functions. B is called the inverse 
of « (under convolution multiplication) if 


Xo B =€E 
This fact is indicated by writing B = a7!. 
Thus, «~! is a function having the property «7 '.a=a.a ' =e. It is 


easy to show (Problem 1) that if a has an inverse, it is unique. Consequently, 
we may speak of the inverse of «. In Theorem 4, we establish necessary and 
sufficient conditions which insure that an inverse will exist. 


= THEOREM 4/ The arithmetic function « has an inverse if, and only if, 
a(1) #0. 


PROOF: Suppose «(1) # 0. Define the function f inductively by 


1 
a 


—— if : 
B(n) ma 7 Enos *) ifn>t (7.3) 


BA) = 
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We will prove that B = a~*. If n = 1, then 


(Bo a)(1) = B(L)a(1) = 1 = €(1) 
Ifn > 1, then 


(B« a(n) = ¥ BCd)a() = » ada ) + Benya) 


= ¥ plda(=) + = 7 5 Ses Nx 


= 0 = e(n) 
Since (B o «)(n) = (a o B)(n) = e(n) for each positive integer n, then 
Qo B = B of — E€ 


Thus, B =a7!. 
The proof of the converse is left for the reader (Exercise 4). 


= THEOREM 5/ If @ is a multiplicative function, not identically zero, then 
a! exists and is multiplicative. 


PROOF: Since « is not identically zero, then a(1) = 1. Thus, from Theorem 4 
[using (7.3)], it follows that a~! exists, that 


a~*d)=1 
and 
ohn, 
a~*(n) = i (@o(*) ifn>1 (7.4) 


We must now prove that if a and 5b are relatively prime positive integers, 
a~1(ab) = a~1(a)x~ *(b). Suppose there is a pair of relatively prime positive 
integers a and b such that «~‘(ab) # «~‘(a)a~‘(b). From the well-ordering 
principle we know that there exists a pair of relatively prime positive integers 
with this property such that their product is the smallest element in the set of 
all such products. Let m, n be this pair. Then 


(m, n) = 1, a *(mn) 4 a *(m)a~ *(n) 


and if c and d are relatively prime positive integers such that cd < mn, then 
a~*(cd) = a7 *(c) a *(d). It is obvious that neither m nor n is equal to 1. 
Consider the quantity «~'(m) «7 *(n) — a~*(@mn). From (7.4) we obtain 


a *(m)a~*(n) — a7 *(mn) = a7 "(ma *(n) + > a t(da(™") 


ad<mn 
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If djmn, we can write d= st where s|m and s|n. We can now rewrite the 
above equation as 


a” *(m)a~*(n) — a (mn) = a (ma "(ny + a 'sta(™ | ") 


tin 
st<mn 


Because st < mn, then a” '(st) = «7 '(s) a~*(t). Using the fact that «(1) = 1, 
we obtain 


a~*(m)a~*(n) — a7 *(mn) = a (man) + a ¥a-*@a(™)a(*) 


st<mn 


= a7 }(m)a(1)07 1(n)a(1) 


+ d a *)a(™)a~ ‘(o2(") 


st<mn 


ms & o. H)a(™)ar ‘(t)o (*) 


“(Ee ro()) Revo() 


= (a7 * 0 a)(m) + (a * o a)(n) 
= €(m)° e(n) = 0 


since m and n are not both equal to 1. 

Thus, a~ (mn) = «~*(m)a~1(n), which contradicts the choice of m and n. 
Thus, «~'(ab) =a *(a)x~*(6) for any pair of relatively prime positive in- 
tegers, a, b; that is, «~' is multiplicative. 


EXERCISES 


1. Let a(n) =n’. Calculate «~1(n) for n = 1, 2, --, 10. 


2. (a) Show that €0.0=a.e€e=a for any arithmetic function «. 
(b) Show that «~! =e and (a ')7! =a. 


3. Suppose a and f have inverses. Show that a » B also has an inverse and that 
(a oO B)~? = at fe) pi 


4. Complete the proof of Theorem 4 by proving: 
If «(1) = 0, then no function Pf exists such that « o B =e. 
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PROBLEMS 


1. Prove that the inverse of a function is unique; that is, if «. B =e and 
aoy =e, then B =y. 


2. (Requires knowledge of elementary modern algebra.) Let G be the set of all 
arithmetic functions « such that «(1) 4 0. Let H be the set of all elements of G 
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which are multiplicative. Show that G is a group under the operation “‘.”’ and 
that His a subgroup of G. 


3. Define the binary operation ‘“‘*”’ on the arithmetic functions by 
(a * B)(n) = p64) 


(a) Prove:a#B=fB *«a. 

(b) Prove: If « and f are multiplicative, so is a * B. 

(c) Show by an example that the operation “‘*”’ is not associative. 

(d) If wand a « B are multiplicative, does it follow that 8 is multiplicative ? 


4. Use Problem 3 to prove: If n = p,*' + p,*?--- p,"*, where the p; are distinct 
primes, then 


Ro Pe pomti+i 


do(d) = Y 
OO pit+l p2t+1 P+ 1 


[Elementary Problem E1725, American Mathematical Monthly, Vol. 71 (1964), 
p. 912.] 


7.4 / THE MOBIUS INVERSION FORMULA 


The following problem was solved by A. F. Mébius (1790-1868), a student 
of Gauss. 


Suppose the arithmetic functions a and B are related by the following formula: 
For each positive integer n, 


a(n) = >) B(d) 
d|n 
Is it possible to ‘invert’ the series and express B as a function of a? 


In order to solve this problem we introduce several basic functions that can 
be used to construct more complicated ones. 
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= DEFINITION / The functions y, 1, and yp are defined as follows: 


v(n) = 1 for each positive integer 7, 

1(n) = n for each positive integer n, 
-1 
w=v. 


Thus, for example, t=v.ovando=1t1ov. 
The problem of Mobius can now be restated as follows: Given that « = B o v, 
express B in terms of a. Using the theory developed in this chapter, it is trivial 


that 
B=aov '=aon 


Thus, 


n 
pin) = » aay") 
d|n 
for each positive integer n. (The M6bius Inversion Formula.) 


The function p, called the Mébius function, plays a role in the solution of 
many advanced problems in number theory. In the next theorem we obtain 
a formula for the exact value of this function. 


s THEOREM 6/ (1) =1, 
y(n) = 0, if n is divisible by a square, 
H(PyP2 °° Py) = (- 1)‘, if the primes 
Pi> P2> °**> Py are distinct. 


PROOF: From (7.4) it follows that u(1) = 1 and 
n 
un) = — » «ar(5) =~ Daa) (7.5) 
d<n d<n 


Thus, if p is a prime number, 


u(p) = — Ze) = —p(1)= —-1 


Because p is multiplicative (it is the inverse of a multiplicative function), then 
if P,, Po, °°» Py are distinct primes, 


M(PiP2 °° Px) = HPs) M(p2) «** M(p,) = (— 1)" 
From (7.5) it follows that 
u(p?) = — {u(1) + u(p)} = 0 
u(p*) = — {u(1) + w(p) + u(p”)} = 0 
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If we continue by induction on n, we see that p(p") = 0 (n = 2, 3, ---). (The 
details are left to the reader.) Since yp is multiplicative, it follows that if is 
divisible by a square, then p(n) = 0. 


The following theorem establishes a connection between certain sums 
involving a o 8 and sums involving « and f. From this theorem a number of 
interesting relationships between arithmetic functions can be derived. We 
state two of these relationships in the corollaries, some others are in the 
exercises. 


= THEOREM 7 / Let « and B be arithmetic functions. Then 


n n [n/d] n  [n/d] 
(as Axim) = ¥ (aca) ¥ 9) = YY aeewy 


PROOF: 2, Co o B\(m) = De. » a(d)p (") 

Each integer d in the second sum is in the range from 1 to n, and each 
integer in that range occurs as such a dat least once. If we fix d, we will obtain 
a term «(d) B(m/d) for every multiple m of d, 1 < m <n; that is, a term a(d) 
B(kd/d) for every k between 1 and n/d (inclusive). If we sum over all such d, 
and, for fixed d, all such k, we obtain the desired result. 


s COROLLARY 7.1/ ¥ pm) “| =1 
m=1 m 
PROOF: Let « = pn, B = v in Theorem 7. 


= COROLLARY 72/ Y cdm= ¥ [=] | 
m=1 m=1L.m 
PROOF: Let « = B = vin Theorem 7. 


Corollary 7.2 can be used to establish that 


y em) 


is “approximately equal” to 1 - log n. Consequently, for large values of n, the 
sum behaves quite regularly although 7 is itself rather irregular. This approxi- 
mation is a consequence of the fact that the second sum in the statement of 
Corollary 7.2 is an exact count of the lattice points in the first quadrant that 
lie on or under the curve y = n/x. It can be proved that in this case, the number 
of lattice points is approximately equal to the area under the curve for 
1 < x <n. The proof is comparatively difficult and is omitted in this book. 
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EXERCISES 


1. Prove: 
Yud@)=0, ifn>1. 


2. Prove: 
n 

»; au (=) = o(0). 

d|n 
(See Theorem 7 of Chapter 4.) 
3. Use Exercise 2 to prove that g o t =a. (See Problem 1 of Section 7.2.) 
4. Prove: 

d = ni? 
Ii 


5. Use Theorem 7 to prove that 


Y o(m) = 


2» m[n/m]. 


m= 


PROBLEMS 
1. Let a be a totally multiplicative function, «(1)40. Prove that 
a~4(n) = a(n)u(n) for each positive integer n. 
2. Prove: If f(n) > 0 for each positive integer n and 

g(n) = Ly (d) 
then 

fa) =[[aar 
d|n 


[ Hint: Use logarithms. | 


7.5* / THE RING OF ARITHMETIC FUNCTIONS 
This section has as prerequisite: a knowledge of elementary modern algebra. 


We have previously defined the “‘ product” of two arithmetic functions. We 
now define the “‘sum”’ of two functions. 


m DEFINITION / Let « and £ be arithmetic functions. The function « @ B is 
defined by 


(% © B)(n) = a(n) + Bin) 
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= THEOREM 8 / The arithmetic functions form an integral domain under 
the operations © and o. 


PROOF: We must show that the functions have the following properties 
under the two operations: 


(1) @@B) Oy =a OG (BOY). 

(2)a@p=fp@a. 

(3) There is a “‘zero”’ function, denoted by 0, such that « © 0 = « for each 
function «. 

(4) If « is an arithmetic function, then a has an “additive inverse”’ B such 
thata® p= B@Oa =8. 

(5) ao (Boy) =(@o B) oy (proved in Theorem 2). 

(6) a. B = B.a (proved in Theorem 1). 

(7) ao(B@y) =a. PB @a oy (distributive property). 

(8) There is a “multiplicative identity”’ (the function e). 

(9) Ifa. Bp =a.yanda #68, then B =» (the cancellation law). 


If we define 6 by 
O(n) = 0 (n = 1, 2, ---) 


then 0 has property (3). For (4) we define the function B by 
B(n) = — a(n) (n= 1; 2, see) 
Except for (9) the proofs of the other properties only involve manipulation. 
PROOF OF (9): Proving the cancellation law is equivalent to proving that if 
ao Bf =8, then a = 6 or B = 0. Suppose that « 4 6 and B # 6. Let mand n be 
the smallest positive integers such that a(m) #0 and f(n) #0. Let d be a 


positive divisor of mn. If 1<d<~m, then a(d) =0, and, if m<d< mn, 
B(mn/d) = 0. Therefore, 


(#- Byimn) = ¥a(aB(™") = a(mB(n) # 0 


It follows that if « . B = 0, then either « or B must be 0. 


EXERCISES 


1. Prove: (a®B)@y =a @G(PO@y). 
2. Prove:a@®B=BOa«. 
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3. Prove: If « and B are multiplicative functions, neither equal to 0, then 
a@® B is not multiplicative. 


4. Prove: 0.0 = @ for each «a. 


5. Use the cancellation law to prove: Ifa.a=e,thena=+e. 


PROBLEMS 


OL 
1. We define = to be the arithmetic function y if y is the unique solution of 


Boy =. 


(a) Prove that if y is a solution of B . y =a, then it is the only solution, pro- 
vided B # 0. 


oL 
(b) Prove that if B(1) 4 0, then 7 exists. 


2. Prove the following “laws of fractions’’: 


a 
(a) ==€; 
a 
(b) = 0 tod 
et. ae a 
os aAop~y® fr) 
oe 
as 


ELEMENTARY RESULTS 
ON THE DISTRIBUTION 
OF PRIME NUMBERS 


8.1 / INTRODUCTION 


Some of the deepest investigations in number theory are concerned with the 
distribution of prime numbers. Most of these researches are in the realm of 
analytic number theory (the applications of real and complex analysis to 
problems in number theory) and are thus outside the scope of this book. 

In this chapter a few of the elementary results on the distribution of prime 
numbers are proved and some of the deeper results are stated without proof. 
In the appendix some techniques of factorization are discussed. 


8.2 / THE INFINITY OF PRIME NUMBERS 


One of the oldest surviving proofs in number theory is the following proof of 
an infinity of prime numbers, due to Euclid. 


=m THEOREM 1 / There is an infinity of prime numbers. 


PROOF: Suppose the set of primes is finite. Let p,, p2, ---, p, be all of the 
primes. Form the number 


mM = Py Pr" Pn — 1 
Since m is not divisible by any of the prime numbers in the above list, then m is 


a composite number that is not divisible by a prime number, which contradicts 
the unique factorization theorem. Thus, the number of primes is infinite. 


Several modifications can be made in the above proof that enable us to 
prove related results. One such modification proves that if p, is the kth prime 
in the natural ordering (that is, p, = 2, p. = 3, p3 =5, -:-), then 


Pe+1 SPiP2°' Pe — 1 
(Problem 2). 
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We can prove a second modification: the residue class 2, (mod 4) contains 
an infinity of primes. Before proving this result, we observe that since 2 is the 
only even prime, all other primes are in #, (mod 4) or &; (mod 4). 


=» THEOREM 2 / There is an infinity of prime numbers that are congruent 
to 3 (mod 4). 


PROOF: Suppose the number of such primes is finite. Let 


P1> P2>°**> Pn 


be the prime numbers in &@, (mod 4). Form the number 


m=4D, Pr Pa — | 


Then m is odd and is not divisible by any of the primes in our list. Since m is 
congruent to 3 (mod 4), then m is composite. Thus, m has prime factors, all of 
which are congruent to 1 (mod 4). But the product of numbers congruent to 1 
(mod 4) is also congruent to 1 (mod 4), and so 


1 = m= 3 (mod 4) 


which is impossible. Thus, #3 (mod 4) contains an infinite number of primes. 


Does the residue class 2, (mod 4) also contain an infinity of prime num- 
bers? The answer is affirmative, but unfortunately, the proof is not as simple 
as that of Theorem 2 (see Section 9.6). 

A more general problem is the following: Let a and m be relatively prime 
positive integers. Does the residue class 2, (mod m) contain an infinity of 
primes? The following theorem answers this question completely. The proof 
is beyond the scope of this book. 


=» THEOREM OF DIRICHLET / Let a and m be relatively prime positive 
integers. There is an infinity of prime numbers in the arithmetic progression 
{a,a+m,a+2m,a+ 3m, ---} 
Given the theorem of Dirichlet, we might suppose that the prime numbers 


are distributed among the integers in a regular pattern. It is easy to see, how- 
ever, that this is not the case. Suppose 7 is a positive integer. Then 


fn+1)!+2,n+1)!43,(n+D!4+4-,@4+ I!+(4+ I} 


is a set of nm consecutive composite numbers. Thus, the prime numbers are not 
distributed in a regular pattern. 
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EXERCISE 


1. Prove that there is an infinity of prime numbers in the residue class #, 
(mod 6). 


PROBLEMS 


1. Where does the proof break down if we try to modify the proof of Theorem 
2 to prove an infinity of primes congruent to 1 (mod 4)? 


2. Modify the proof of Theorem 1 to prove that if the Ath prime is p, then 
Pett. SP1P2°** Pe - 1 


8.3 / THE SIEVE OF ERATOSTHENES 


One of the most efficient methods of constructing tables of prime numbers 
is the sieving process, invented by the Greek mathematician Eratosthenes 
(276-194 B.c.). The work involved in the process is simplified if we make the 
following observation: 

If the integer n is composite, it must have a prime factor that is less than or 
equal to ./n. If this were not the case, we could write n = ab where a > ./n and 
b > Jn. This would imply that n = ab > Jn my n =n, which is impossible. 

To illustrate the method we will construct a table containing all primes less 
than 130. 

We begin by listing all of the numbers from 2 to 129. See Table 8.1. (Since 1 
is not a prime it is omitted.) 

The first number in our list, 2, must be a prime and no multiple of 2 except 2 
itself can be a prime. We remove all multiples of 2 (except 2 - 1) from our list 
and place the symbol “‘—.”’ above the position. The next remaining number, 3, 
must be a prime so we delete the multiples of 3, placing the symbol “‘~”’ above 
the position. We now delete the multiples of 5(“‘ - ’’), the multiples of 7 (“ =”’), 
and the multiples of 11 (“‘~’’). (Our work is simplified if we notice that the 
first multiple of 5 to be removed is 25, the first multiple of 7 is 49 and the first 
multiple of 11 is 121.) 


Because the largest prime less than er 130 is 11, it follows from the observa- 
tion preceding the example that the remaining numbers must be primes. Thus, 
the primes less than 130 are 2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 
53, 59, 61, 67, 71, 73, 79, 83, 89, 97, 101, 103, 107, 109, 113, 127. 

Observe that we have constructed a factor table as well as a table of primes. 
For example, we see that the smallest prime factor of 119 is 7. 
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The sieve of Eratosthenes can be used to derive a formula for the number of 


primes less than or equal to n if the primes less than or equal to 4/ n are known. 
In order to use the standard notation we define the function z. 


ws DEFINITION / Let x be a real number. x(x) is the number of prime 
numbers less than or equal to x. 


Table 8.1* 

= 2 3 4 5 6 7 B 9 
JO Wo 12 13 14 %I5 16 17 18 19 
20 21 22 23 24 25 26 27 28 29 
30 31) 3233 4 35 86 8 88D 
40 41 42 43 44 45 46 47 48 49 
50 51 52 53 54 55 56 57 58 59 
60 61 6 6 64 6 6 67 6 69 
7 WU 2 73 4 5 1% 77 2% 79 
80 81 8 83 84 85 86 87 88 89 
90 91 92 93 _94 95 96 97 98 99 
100 101 102 103 104 105 106 107 108 109 
10 «U1 112s 18 14S 6ssd117—s—s8—s“*'9 
120 121 122 123 124 125 126 127 128 = 129 


*The symbol — indicates that the number is a multiple of 2; ~, a multiple 
of 3; -, a multiple of 5; =, a multiple of 7; ~, a multiple of 11. 


Example. 
m(—5) = m(114) = 0, 
n(2) =7(2.7) =1, 
mi2) =7(il) =5S. 


If p1, Do. °**, P, are the primes less than or equal to n, the formula for x(n) is 


certo [2] -[E]+~-L 
7 tL] ul Lin] say Be - Lin] — (8.1) 
r Lal) 7 \Lioa aa Feed) a 


et pn) 
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To prove this formula, we recall the steps in the sieve of Eratosthenes. The 
number of elements in the original set is n — 1. The number divisible by p, is 
[n/p,], the number divisible by p, is [n/p,], and so on. If we delete all numbers 
divisible by p, (including p, itself) and then all numbers divisible by p., we 
have deleted [n/p, ] + ({n/p2]— [n/p,p2]) numbers. (The [n/p, p, ] numbers are 
those divisible by both p, and p, and were thus deleted at the first stage.) If 
we continue this line of reasoning. we obtain (m — 1) plus the complicated 
expression involving the greatest integer function for the number of elements 
remaining in the set. In the process we have removed the prime numbers p,, 
..-» D, from the list as well as all of their multiples. When we replace these 
numbers, we obtain (8.1). 

As a check on the formula we calculate the number of primes less than or 


equal to 129. In this case 
n(./n) = n(,/129) = 5 


The primes less than or equal to ff 129 are 


2, 3, 5, 7, and il 
From the formula we obtain 


comers )-[2]-[2)-[2 
EELS 
age ee 
‘Leta ffal-b-Leead-b 
saul? id Fer cs a+” s|+ Goan ae rear ba bers oa Peas 
derail  Eectacr fecoon 


=129-14+5—64—43 —25-18—11+ 21+ 12 
LOLSeREC EI PIA a 
~1—0-—1-0-0-04+0+0+0+0+0-0 
= 31 


We can verify that this is correct by counting the primes in our table. 
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Although the formula is very awkward to use, it is the only formula for the 
exact value of x(n). In actual computation there are a number of shortcuts 
that may be used but the process remains cumbersome. By use of the formula it 
has been established that 


7(100,000,000) = 5,761,455 
and 


n(1,000,000,000) = 50,847,478 


EXERCISES 


1. Use the sieve of Eratosthenes to calculate all prime numbers less than 200. 


2. Use formula (8.1) to calculate (200). 
3. Prove: If x(/ n) = k, then formula (8.1) has 2* + 2 terms. 


8.4* / REMARKS ON THE DISTRIBUTION OF PRIME 
NUMBERS 


Until the problem was considered by Gauss, little of significance was 
determined about the distribution of prime numbers. Much of the effort 
before that time had been wasted on attempts to find polynomials that generate 
all of the primes or take on only prime values.’ For example, it was believed 
by certain mathematicians in the Middle Ages that the polynomial 


f(x) =x? —-x4+4l 


takes on only prime values as x varies over the nonnegative integers. Although 
this is the case for x = 0, 1, 2, ---, 40, it is easy to see that 


fl) =41? 


which is composite. It is now relatively simple for us to prove that any integral 
polynomial must take on composite values (see Problem 4, Section 3.6). 
Thus, the attempts mentioned above were doomed to failure. 


1It has been proved theoretically that prime-generating functions (not poly- 
nomials) do exist, but to date no one has exhibited a nontrivial function of 
this type. As an example, W. H. Mills (Bulletin, American Mathematical 
Society, June, 1940) proved that there is a real number « such that [«*"] takes 
on only prime values. 
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Even Fermat attempted to find a prime-generating function. He believed 
that any number of form 


fy = 2" +1 (n =O, 1, 2, ++) 


is a prime, but had to admit that he could not prove it. The proposition is 
correct for n = 0, 1, 2, 3, 4, but by factoring /;, Euler showed that the general 
proposition is false. In this case, Fermat’s intuition was quite wrong, because 
no additional primes of this type have ever been found, even after extensive 
searches with electronic computers. 


THE PRIME NUMBER THEOREM 


Gauss changed the direction of research on prime numbers by instituting a 
study of x(x), the number of primes not exceeding x. While studying prime 
tables, he noticed that as x becomes large, (x) behaves similarly to x/log x 
(the logarithm to the base e), and he later conjectured that 


nee 
x20 X/logx — 


This proposition, known as the prime number theorem, is usually written 
using the abbreviated notation 


RA: log x 

[We say that x(x) is asymptotically equal to x/log x.] Note that this does not 
imply that the difference x(x) — x/log x ever becomes small, but only that it 
comes small compared to x/log x. 

The first major step towards a proof of this theorem was made by the 
Russian mathematician Chebychev, who, in 1850, proved that positive 
constants a and 5 exist such that 
a yeh (x > 2) 


7 log x log x i 


In 1896, Hardamard and Vallee-Poussin independently proved the prime 
number theorem. The proofs depended on deep investigations in complex 
analysis, starting with Riemann’s zeta function” 


1 1 1 
= | ans ae! — — eee 
C(s) Poe tiag Haat (s > 1) 


2The zeta function is intimately connected with prime numbers because 


Le Aek 1 1 
W)=(1+ 545+ ~)(14 54 ate --) 


the product extending over all prime numbers. 
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For years it was believed that no elementary proof of the prime number 
existed, but in 1950 P. Erdés and A. Selberg were able to prove this theorem 
using nothing more advanced than elementary calculus. 


PRIMES IN ARITHMETIC PROGRESSION 


As mentioned in Section 8.2, Direchlet proved that if (a, m) = 1, the residue 
class 2, (mod m) contains an infinity of prime numbers. It is natural to inquire 
whether certain residue classes are “‘richer”’ in primes than are others. Vallee- 
Poussin established that this is not the case by proving a “‘prime number 
theorem for primes in arithmetic progression.’’ He proved that if (a, m) = 1 
and 7, (x) denotes the number of primes of form a+km, which do not 
exceed x, then 

Lt x 
p(m) log x 

In other words, over a large finite interval, all of the residue classes rela- 
tively prime to m contain “‘approximately”’ the same number of primes. 


Tea, m(X) out 


BERTRAND’S CONJECTURE 


On the basis of tables J. Bertrand conjectured in 1845 that ifn > 3, there isa 
prime number between 7 and 2n — 2. It is easy to verify this for small values of 
n. For example, 


5 is between n and 2n — 2, ifn =4 

7 is between n and 2n — 2, ifn =5 or 6 
11 is between n and 2n — 2, if n = 7, 8, 9, 10 
19 is between n and 2n — 2, if n = 11, 12, ---, 18 
31 is between n and 2n — 2, if n = 19, 20, ---, 30. 


This conjecture was proved soon afterwards by Chebychev, who also proved 
the following stronger statement: 

If « >, there is an integer N (which depends on €) such that if x is any in- 
teger greater than N, there is a prime number between x and x(1 + €). 

Chebychev’s result has since been improved so as to remove the restriction 
that « > 14. We now know that the proposition is true for any positive num- 
ber €. 

If ¢ = 1, we obtain from Bertrand’s conjecture and direct verification that 
there is a prime number between x (exclusive) and 2x (inclusive).° 


PRIME TWINS 


Two consecutive odd primes are called prime twins. For example, the first 
few prime twins are 3 and 5, 5 and 7, 11 and 13, 17 and 19, 29 and 31, 41 and 
43. 


>The bound 2x is needed only for x = 1. 
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Is there an infinity of prime twins? This is one of the oldest problems in 
mathematics. We have some information about the distribution of prime 
twins, but the above question is still unsolved. It has been proved that the 
series 

1 1 1 
es + as + poe + coe 
Pi P2 P3 


is divergent when summed over all prime numbers, while the series 


: a re | 

ects + — + peetces + eee 

41 42 43 
is convergent when summed over all prime twins. Thus, the prime twins are 
comparatively rare in the sequence of all prime numbers. 

Many results in number theory were first conjectured on the basis of 
plausible arguments that could not be developed into rigorous proofs. There 
is an argument of this type that seems to indicate that the number of prime 
twins is infinite and that a “‘prime number theorem” holds for prime twins. If 
we let a(x) denote the number of prime twins that do not exceed x, the 
argument leads to the conjecture that 


cx 
log? x 


a(x) ~ 


where c is a certain positive constant. The conjecture is obtained from 
theoretical considerations and agrees quite well with actual counts made by 
electronic computers. 


GOLDBACH’S CONJECTURE 


In a letter to Euler (in 1742), Christian Goldbach stated that from empirical 
observations he believed that every even integer could be expressed as a sum 
of two odd primes (at that time the number | was counted as a prime number). 

It has since been verified that every even integer n in the range 6 <n < 
100,000 is the sum of two odd primes (excluding 1). This is easy to check for 
small even integers. For example, 


6=3+ 3 22=3+19 
8=3+ 5 24=5+19 
10=5+ 5 26 =7+ 19 
I2Z=7+ 5 28 = 5 + 23 
144=7+ 7 30 =7 + 23 
16=5+11 32 =3 +29 
18=7+ 11 24=5+ 29 
20 =7 + 13 36 = 7+ 29 
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Although many mathematicians have worked on this conjecture, it has 
never been proved or disproved. The first significant advance was made by the 
Russian mathematician Vinogradov, who, in 1937, obtained a partial solution 
to a consequence of Goldbach’s conjecture. Note that the conjecture implies 
that every odd integer greater than 9 is a sum of three odd primes. Vinogradov 
proved that every sufficiently large odd integer is the sum of three odd primes; 
that is, there is a number N such that any odd integer greater than N is the 
sum of three odd primes. Theorems of this type are useful if we can get a 
reasonable upper bound on N, because we can check the remaining numbers 
and see if the proposition holds for them. Unfortunately, the value obtained 
for N is so large that the remaining numbers can not be checked even on an 
electronic computer. 


APPENDIX / FACTORIZATION METHODS OF FERMAT 
AND EULER 


A. INTRODUCTION 


Many problems in number theory require the factorization of some given 
integer. The only efficient factorization method is by the use of factor tables, 
but, unfortunately, it is not possible to include all of the integers in these 
tables. Consequently, we must rely on other, less efficient, methods. 


The most obvious technique is to divide the primes less than ./ N into N and 
see which, if any, leave no remainder. This works well if N is small, but is 
impractical if N is large, since we probably will not know which of the 


numbers less than FY N are primes. It is also obvious that the labor is prohibi- 
tive. 

There are a number of methods for factoring large numbers. Because many 
of them are variations on two basic methods, each of which having a compara- 
tively simple proof, we will restrict ourselves to these two methods—one due 
to Fermat and one to Euler. 

As we shall see, each of the methods involves recognizing when certain 
numbers are squares. In order to facilitate our work, we construct a table 
that can be used to eliminate many nonsquares from consideration. 

If we compute the squares of 0, 1, 2, ---, 9, modulo 10, we see that 


07=0 

(?7=97=1 

27=87=4 (mod 10) 
77 =9 

47=67=6 


575 
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Thus, if an integer ends in one of the digits 2, 3, 7, or 8, we see that it is not a 
square. Consequently, we can eliminate approximately 40 percent of the 
numbers from consideration. 

If we repeat this process for the modulus 100, we find that a square must end 
in one of the pairs of digits listed in Table 8.2. 


Table 8.2 Table of last two digits of squares. 


00 16 21 36 4] 56 61 76 81 
01 24 44 64 84 
04 25 49 69 89 
09 29 


By checking the last two digits against the 22 entries in the table, we see that 
we can eliminate approximately 78 percent of the numbers from consideration. 


B. FERMAT’S METHOD 


Fermat’s factorization method starts from the observation that if N is a 
difference of two squares, say N = x* — y”, then N can be factored as 


N = (x — y)(x + y) 


The method depends on the fact that if N is odd,* the converse is also true. 


m THEOREM 3/ Let N bean odd integer. If N can be factored as 
N=ab(1 <a<b) 
then N can be written as a difference of two squares x* and y”, where 


wes sa _b-a 
~ 9 es, 


The proof follows immediately from the expansion of 


2 2 


In order to apply the theorem we note that if N is the difference of two 
squares, we can write 


x? —~N=y?* 


“The factors of 2 are obvious, so we may assume that N is odd. 
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We now substitute values of x starting with x = [J/ N] + 1. If N is composite, 
we will eventually find a value of x for which x? — N is a square. 
Rather than compute x”, (x + 1)”, (x + 2), ---, separately, Fermat noted 
that the method is easier to apply if we use the following scheme: 
(x + 1)? —N=(x? —N)4+ (2x41) 
(x + 2)? — N= [(x + 1)? — N] + (2x + 3) 
(x + 3)? — N= [(x + 2)? — N] + (2x + 5) 


The following example is the one used by Fermat to illustrate his method: 
Example I. 


Factor N = 2,027,651,281. 
SOLUTION: Since [V NV] = 45,029 we start our calculations with x = 45,030. 


x x?—N 

45,030 49,619 
2: 45,030+ 1= 90,061 
45,031 139,680 
2 - 45,030 + 3 = 90,063 
45,032 229,743 
2+ 45,030 +5 = 90,065 
45,033 319,808 
2° 45,030 + 7 = 90,067 
45,034 409,875 
90,069 
45,035 499,944 
90,071 
45,036 590,015 
90,073 
45,037 680,088 
90,075 
45,038 770,163 
90,077 
45,039 820,240 
90,079 
45,040 950,319 
90,081 

45,041 1,040,400 = 1020? 
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Thus, 
N = 45,0412 — 1020? = (45,041 — 1020)(45,041 + 1020) 
= 44,021 - 46,061 


In the above list we can eliminate all values of x? — Nexcept 499,944 and 1,040,400 
by checking our table of last digits of squares. 


Several short cuts are available. For example, it is easy to eliminate the 
factors 3, 5, and 11 by inspection.° If we eliminate all of the primes less than 
the prime p as factors, we must only check 


N + p? 

7-N  forx=[,/N | ] 
x or x [./N J+ ap 
(see Exercise 2). If we check the last two digits of N against the table of squares, 
we can eliminate many values of x from consideration. In the above example, 
because N ends in 81, we see that x? must end in 81 and y? in 00, or x? in 25 
and y” in 44. Thus, x must end in 09, 41, 59, 91, 05, 15, 25, 35, 45, 55, 65, 75, 
85, or 95. 


EXERCISES 


1. Use Fermat’s method to factor 
(a) 3,085,811; 
(6b) 3,642,517. 


2. Prove: If the smallest prime factor of N is greater than or equal to c, then 
in Fermat’s method we do not need to check x? — N for x greater than 
(N + c*)/2c. 


C. EULER’S METHOD 
In Chapter 10 we will prove the following theorem (Theorem 4): 


Let the positive integer N be written in the form N = mn, where n does not 
contain a square factor. N can be written as a sum of two squares (one may be 
07) if, and only if, each odd prime factor of n is of form 4k + 1. 


Using this theorem we can prove a factorization method commonly 
ascribed to Euler.© The method is more limited than Fermat’s, because it can 


>N has 3 as a factor if and only if the sum of the digits is congruent to zero 
(mod 3), and has 11 as a factor if and only if the alternating sum of the 
digits is congruent to zero (mod 11). 

*The method was apparently known to both Mersenne and de Bessy, but 
Euler was the first person to use it extensively. 


APPENDIX | FACTORIZATION METHODS a 129 


be applied only to numbers of form 4k + 1. Furthermore, in certain cases we 
cannot factor the number in question, but we are merely informed that the 
number is composite, with an even number of factors of form 4k + 3. 

An immediate consequence of the above theorem is: 


If N is a positive integer of form 4k + 1 that cannot be written as a sum of 
two squares, then N is composite and has an even number of prime factors of 
form 4k + 3, at least two of which are distinct. 


We now turn our attention to those integers of form 4k + 1 that can be 
written as sums of two squares. [Note: If N = a” + b*, where a and b are not 
relatively prime, then (a*, b*) is a factor of N. Thus, in the sequel, we need only 
consider the case N = a? + b*, where a and Db are relatively prime.] 


=m THEOREM 4/ Let N be a positive integer of form 4k + 1 which can be 
written as a sum of two relatively prime squares. Then N is composite if, and 
only if, N can be written as a sum of two squares in two distinct ways. 


PROOF: (Part I) Suppose N is composite. If N is a square the proof is 
trivial. If N is not a square, we can write N = ab (1 <a, 1 < b), where we may 
assume that a has all of the prime factors of form 4k + 3. Then 5 has only 
prime factors of form 4k + 1, and so we can write b as a sum of two squares: 


b=x?+y? 


We now distinguish two cases. 
Case I. If ais a square, say a = t?, then 


N = ab = (tx)* + (ty)* 


Because N can also be written as a sum of two relatively prime squares, there 
are two ways of writing N as a sum of two squares. 
Case IT. If ais not a square, it follows from the theorem quoted above that 


a=m*a' 


where a’ has only prime factors of form 4k + 1. Thus, a can be written as a 
sum of two nonzero squares, 
a=s*+t? 


If either x or y 1s zero the proof is as in Case I. If neither x nor y is zero, on 
substituting, we obtain 


N = ab =(s* + t?)(x? + y”) 
= (sx + ty)? + (sy — tx)? 
= (sx — ty)’ + (sy + tx)? 
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Since N 1s odd, it follows that |sx — ty| # |sy — tx|, and so N can be written 
in two ways as a sum of two squares. 

(Part II) Suppose N can be written in two distinct ways as a sum of two 
squares: 


N=a?4+b* =c? +d? 


In each sum one of the squares must be even and the other odd. Suppose a and 
c are even. Then 


g-e@=g@—h 
and so 
(a—c\(a+c) =(d— b\d+b) (8.2) 


If D =(a—c, d—b), Dis even, since a — c and d — bare even. We now write 
a-—c=Ds d—b=Dt (s,¢t)=1 
Substituting into (8.2) and cancelling D, we obtain 
s(a+c)=t(d+ 5) (8.3) 


which implies that 
s|(d + b) 


since (s, t) = 1. We can now write 
d+b=su 
and substitute into (8.3) obtaining 
atc=tu 
Because s and ¢ are relatively prime, 
u=(a+c,d+b) 
and so uw is even. It now follows that N is composite since 
N = 4Q2N + 2N) = 42a? + 2b? + 2c? + 2d?) 
= Y[(a-—c)* + (atc) +(d—b)? +(d+b)7] 
= Y4(D*s? + t?u* + D?t? + s*u*) 
= 4(D? + u*)(s* + t’) 


[2+ }eo 
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In the last part of the proof we did not use the fact that N can be written as a 
sum of two relatively prime squares. Thus, we obtain the following corollary 
to the proof: 


=m COROLLARY 4.1 / Let N be an odd integer that can be written as a sum 
of two squares in two distinct ways: 
N =a* +b? =c* +d? (aandc even) 


Then N is composite and can be factored as 


(BG 


D=(a—c,d-—b) s= 


where 


_d+b 
7 S 


u 


In summary, if N is a positive integer of form 4k + 1, then 

1. If N cannot be written as a sum of two squares, N is composite, with an 
even number of prime factors of form 4k + 3, at least two of which are dis- 
tinct. 

2. If N can be written as a sum of two squares that are not relatively prime, 
N is composite and has the greatest common divisor of the two squares as a 
factor. 

3. If N can be written as a sum of two relatively prime squares in just one 
way, N is a prime number. 

4. If N can be written as a sum of two squares in two distinct ways, Corol- 
lary 4.1 can be used to factor N. 

To apply the method, we observe that if N is a sum of two squares, one of 
the squares must be less than or equal to N/2. Thus, we must check whether 


N — a’ is a square for a = 1, 2, ---, [,/N/2]. The work is expedited if we note 
that 
N —(a— 1)? =(N — a’) + Qa — 1) 


N —(a— 2)? =[N — (a— 1)7] + Qa — 3) 
N —(a— 3)? =[N — (a —2)"] + a — 5) 


Example 2. 


Use Euler’s method to factor N = 493, 
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SOLUTION: Since WN is of form 4k + 1, the method can be used. We must check 
N — a? for a= 1, 2, «++, [VN/2] = 15. 


a N — a? 

15 268 
2:-15—1= 29 
14 297 
2°15—3= 27 

13 324 = 18? 
2°15—S= 25 
12 349 
23 
11 372 
21 
10 393 
19 
9 412 
17 
8 429 
15 
7 444 
13 
6 457 
11 
5 468 
9 
4 477 
7 

3 484 = 22? 


Since N = 132 + 18? = 3? + 227, then 
D= (13 — 3, 22 — 18) =2 
13—3 

2 
ee cose 
2 
pee 
5 


a ae =e) 5? + 27) =17- 29 
3=[(5) +) Jo+29- 


s= 5 


y 


8 
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EXERCISE 


1. Use Euler’s method to decide if the following numbers are primes. If 
possible, use Corollary 4.1 to factor the numbers. 

(a) 1357; 

(b) 1423. 


QUADRATIC 
RESIDUES 


9.1 / INTRODUCTION 


Some general theorems about polynomial congruences were proved in 
Chapters 3 and 4. In this chapter we will consider the simplest nonlinear 
polynomial congruence—the quadratic congruence 


x? = a (mod m) (9.1) 


where a and m are relatively prime. 
In most cases it is sufficient to study this congruence rather than the more 
general quadratic congruence 


Apx? + box + Co = 0 (mod m) (9.2) 


for if ay is relatively prime to m and bg is even (or m is odd), then the congru- 
ence (9.2) can be easily reduced to a congruence of type (9.1) (see Problems 3 
and 4). The problem of solving (9.1) can be further reduced, because by 
Theorem 7 of Chapter 3, solving, 


x? = a (mod p,*!p2** «+ p,**) 
is equivalent to solving the system of congruences 
x? = a (mod p,”') 


x? = a (mod p,””) 


x* = a(mod p,”*) 
As we shall see later solving 
x? = a(mod p*) 
can be accomplished easily if we can solve 


x? =a (mod p) 
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Most of this chapter, therefore, will be concerned with the congruence 
x? = a (mod p) 
where p is an odd prime and a # 0 (mod p). 


m DEFINITION / Let a be relatively prime to m. We say that a is a quadratic 
residue (mod m) if the congruence 


x* = a(mod m) 
has a solution. If the congruence has no solution, we say that a is a quadratic 
nonresidue (mod m). 


Example I. 


By considering the squares of 17, 27, ---, 107 (mod 11), we see that 


1? = 10? = 1 (mod 11) 


2?7= 9? =4(mod 11) 
37 = 87 =9 (mod 11) 
42= 7? =5(mod 11) 


5*= 6* =3(mod 11) 


Thus, the quadratic residues (mod 11) are 1, 3, 4, 5, and 9, and the quadratic 
nonresidues are 2, 6, 7, 8, and 10. 


Example 2. 


By considering the squares of 1, 5, 7, and 11, we see that, modulo 12, the only 
quadratic residue is 1, while the quadratic nonresidues are 5, 7, and 11. 


EXERCISE 
1. List the quadratic residues and quadratic nonresidues (mod m) for: 
(a) m=17; 
(6b) m= 18; 
(c) m=19. 


PROBLEMS 


1. Prove: If a and b are quadratic residues (mod m), then ab is also a quad- 
ratic residue (mod m). 
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2. Prove: If ais a quadratic residue and b is a quadratic nonresidue (mod m), 
then ab is a quadratic nonresidue (mod m). 


3. Show that solving the quadratic congruence 
x? + 2bx + c = 0 (mod m) 
is equivalent to solving 
y? = b? —c (mod m) 


4. Let m =p,"' p,” --+ p,’* (distinct odd primes). Let a be relatively prime to 
m. Show that solving 


ax? + bx +c =0(mod m) 
is equivalent to solving a system of congruences 
y? = A (mod p,"*) 
y’ = A (mod p,”) 


y” = A(mod p,”**) 
[Hint: See Problem 3.] 
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It is not difficult to prove that the product of two quadratic residues is a 
quadratic residue and that the product of a quadratic residue and a non- 
residue is a nonresidue (Problems 1 and 2 of Section 9.1). If we study the 
examples in Section 9.1, we see that the situation is more complicated when we 
have the product of two quadratic nonresidues. Modulo 11, the quadratic 
nonresidues are 2, 6, 7, 8, and 10, and the product of any two numbers in this 
set is a quadratic residue. Modulo 12, the quadratic nonresidues are 5, 7, and 
11 and the product of any two distinct nonresidues is a nonresidue. If we 
considered a few more examples, we would be led to conjecture that the 
product of two quadratic nonresidues is a quadratic residue, provided the 
modulus is a prime number. 

Euler discovered a criterion for deciding when a given number is a quadratic 
residue, modulo an odd prime number. This criterion provides us with the 
simplest proof of the quadratic character of the product of two integers. 


= THEOREM 1 / (Euler’s Criterion) If a is relatively prime to the odd 
prime, p, then a is a quadratic residue or a quadratic nonresidue (mod p), 
according to whether a‘? ~ 1/2 = 1 (mod p) or a? ~ 2/2 = —1 (mod p). 
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PROOF: Using Fermat’s theorem we find that 
(a? ~ 1/2 _ 1yqg?~ Y/2 41) = a?~ 1 — 1 = 0(mod p) 
and thus 
a?~'2=1 or  a?~ Y/? = —] (mod p) 


If a is a quadratic residue (mod p), there exists an integer xy such that x)” =a 
(mod p). By Fermat’s theorem, 


qi’? — WI? = (x57)? ~ 1/2 = x4? 1 = 1 (mod p) 
To prove the converse, we assume that a? ~')/* = 1 (mod p). Ifg is a primitive 


root (mod p), there exists a positive integer ¢ such that 
g’ = a (mod p) 
Then 
gt @~ I2 = g(P~ YI2 = | (mod p) 
which (by Theorem 8 of Chapter 4) implies that 


i 
t- 2 = 0(mod p ~ 1) 


Thus, ¢ is even, and so 
(g'*)? = g' = a (mod p) 


which implies that a is a quadratic residue (mod p). 


mw COROLLARY 1.1 / Let p be a prime number, let a and D be relatively 
prime to p. 

(a) If a and 5 are quadratic residues (mod p), so is ab. 

(b) If a and b are quadratic nonresidues, ab is a quadratic residue. 

(c) If a is a quadratic residue and b is a quadratic nonresidue, then ab is a 
quadratic nonresidue. 


PROOF OF (bd): Since a and bare relatively prime to p, then so is ab. If p = 2, 
there are no quadratic nonresidues. Thus, we need only consider the case in 
which p is an odd prime. Since a and b are quadratic nonresidues, then by 
Euler’s criterion 


(ab)? ~ D2 = gq? ~ V/2p ~ 1/2 = (—1)(—1) = 1 (mod p) 


Therefore, ab is a quadratic residue (mod p). 


Much of the work in quadratic residues is simplified if we use the following 
notation, due to A. M. Legendre (1752-1833). 
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= DEFINITION / Let a be relatively prime to the prime number p. The 
Legendre symbol (ap) is defined to be the number | if ais a quadratic residue 
(mod p), and the number — 1 if ais a quadratic nonresidue (mod p). 


We can now summarize the results of Theorem 1 and Corollary 1.1 as 
follows: 


=» COROLLARY 1.2 / Let a and b be relatively prime to the prime number 
p. Then 
(a) a= b(mod p) implies that (al p) = (b| p); 


(6) (ab| p) = (al p) (lp); 
(c) if p is greater than 2, then (a| p) = a’? ~ 1/* (mod p). 


The following definition will be needed for the problems. 


= DEFINITION / We say that the integers a and b are associates, modulo m, 
if ab = | (mod m). 


PROBLEMS 


1. Let a be relatively prime to the prime number p. Prove that a has an asso- 
ciate (mod p) and that this associate of a is not equal to a unless a= +1 
(mod p). 


2. (Wilson’s theorem). If p is an odd prime number, then 
(p — 1)! = —1 (mod p) 
[Hint: Group the numbers |, 2, ---, p — 1 in pairs that are associates. ] 


3. (Generalized version of Wilson’s theorem.) If p is an odd prime number, 
then 


(Hint: In the product (p — 1)!, note that 


p-1i _ p-1 


aan, ee | ae 
5 +2= (= 1) (mod p) 


and so on.]| 


4. Use Problem 3 to prove that if p is of form 4n + 1, then —1 is a quadratic 
residue (mod p). 
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9.3 / ELEMENTARY REMARKS ON THE DISTRIBUTION 
OF QUADRATIC RESIDUES 


There are a number of interesting problems concerning the distribution of 
quadratic residues, modulo the odd prime number p. In this section we will 
consider the following questions: 

1. Are the number 1, 2, 3,---,»— 1 equally divided among quadratic 
residues and nonresidues? 

2. Can we obtain an upper bound (which depends only on p) for the size of 
the smallest quadratic nonresidue? 

3. Is there an upper bound for the maximum length of a sequence of con- 
secutive quadratic residues ? 

4. How does the quadratic character of a affect the quadratic character of 
p-a? 

We will answer questions | and 4. Bounds for 2 and 3 are considered in the 
problem section. 


=m THEOREM 2/ Letabe relatively prime to the odd prime number p. If p is 
of form 4n + 1, then —a is a quadratic residue (mod p) if, and only if, ais a 
quadratic residue. If p is of form 4n + 3, then —a is a quadratic nonresidue 
(mod p) if, and only if, a is a quadratic residue. 


PROOF: From Theorem | we see that 
(—alp) = (—a)? > 9? & (-1)® > DM g@~ DI? = (— 1) Y(alp)_ (mod p). 
Since the quantities (—a|p) and (—1)"~}/? (a|p) are equal to either 
+1 or —1, this congruence implies that 

(—alp) = (-1)?~ ?*(a[p) 
If p is of form 4n + 1, then (p—1)/2 is even, and so 
(—alp) = (alp) 

If p is of form 4n + 3, then (p — 1)/2 is odd, and so 


(—alp) = — (alp) 


m COROLLARY 2.1 /If p is of form 4n+1, then —1 is a quadratic 
residue (mod p). If p is of form 4n + 3, then —1 is a quadratic nonresidue 
(mod p). 

As a simple consequence of Theorem 2 we find that if p is of form 4n + 3, 
then half of the numbers 

i Dac ped 

are quadratic residues and half are quadratic nonresidues. We will now prove 
that this is always the case. Obviously, this is equivalent to proving that 


> Glp)=0. 
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= THEOREM 3/ If p is an odd prime number then 
p-1 
2, (alp) =0 


ProoF: Let g be a primitive root (mod p). From Theorem 11 of Chapter 4 
we know that g’ ranges over the numbers 1, 2, ---, p — 1 (mod p), in some 
order, as t ranges over the numbers 1, 2, ---, p— 1. Because g belongs to the 
exponent p — | (mod p) then 


g'?~ /2 = —1 (mod p) 
Thus, if a = g' (mod p), then 
(alp) = (g' |p) = (9)? P? = (gf = (— 1 (mod p) 
and so (alp) = (g'lp) = (- 17 


If we now sum the Legendre symbols, we obtain 
p-1 p-i1 pol 
2X Glpy= 2 Gilpy= 2 (-lf=0 


Example. 


Which of the numbers, 1, 2, ---, 18 are quadratic residues, and which are non- 
residues (mod 19)? 


SOLUTION: Write the numbers in the following pattern: 


12 3 4 5 6 7 8 
~- + 4+ -- - = + 
18 17 16 15 14 13 12 11 10 


Because 1, 4, 9, and 16 are squares, they are obviously quadratic residues. By 
Theorem 2 the numbers 18, 15, 10, and 3 are quadratic nonresidues. Because 3 and 
18 are nonresidues and 3 - 6 = 18, Corollary 1.1 implies that 6 is a quadratic residue 
(and thus 13 is a nonresidue). Because 3 is a nonresidue and 6 is a residue, then 2 isa 
nonresidue (and so 17 is a residue). Continuing in this manner, we find that the 
quadratic residues are 1, 4, 5, 6, 7, 9, 11, 16, 17, and the quadratic nonresidues are 
2, 3, 8, 10, 12, 13, 14, 15, and 18. 


EXERCISES 


1. Find the quadratic residues (mod 43). 


2. Find the quadratic residues (mod 41). [Hint: Find 20 quadratic residues. 
The other numbers must be nonresidues. ] 
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PROBLEMS 


In the following problems p is an odd prime number. 
1. If g is the smallest positive quadratic nonresidue (mod p), show that gq is a 
prime number. 


2. If g is the smallest positive quadratic nonresidue (mod p), prove that 


q<v Pp + 1. [Hint: Show that the numbers gq, 2g, ---,(q — 1)qg, are all less 
than p.] 


3. Prove that there exist two consecutive quadratic residues and two consecu- 
tive quadratic nonresidues if p > 7. 


4. Let m be greater than 1. If m, m+ 1,---,-m+k—1 all have the same 
quadratic character, and if m — 1 does not have the same quadratic character 
as m, show that k < m. 


5. Let k be a quadratic nonresidue which is greater than J D. Prove that no 
sequence of k consecutive quadratic residues or k consecutive quadratic non- 
residues exists. 


9.4/ THE LEMMA OF GAUSS 


Two basic questions concerning quadratic residues are: 

1. Given a prime number p, which numbers are quadratic residues and which 
are quadratic nonresidues? 

2. Given an integer a, for which primes is a a quadratic residue and for 
which primes is a a quadratic nonresidue? 

The second question is obviously more difficult than the first, since it 
involves an infinite set of prime numbers, rather than a finite set of residue 
classes. The preceding material in this chapter has been along the lines indi- 
cated by the first question. We now turn our attention to the second. 

Gauss devised a criterion for determining when a number is a quadratic 
residue (mod p), which can sometimes be used to answer the second question. 


=m THEOREM 4/ (The Lemma of Gauss) Let a be relatively prime to the 
odd prime number p. Reduce each of the numbers a, 2a, ---,(p— 1)/2-a 
(mod p) to a number between | and p — 1, inclusive. Let u be the number of 
these elements that exceed p/2. Then (a| p) = (— 1)”. 


PROOF: Let 5,, 53, :+*, 5, be the numbers that exceed p/2, and let t,, t,, ---, t, 
be the remaining numbers. Obviously, these numbers are distinct and so 


p-! 
u+v=—— 
= 2 
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We will now establish that p — s; is not one of the t’s (i = 1, 2, ---, uw). Suppose 
that 


p-s,=t; 


There exist integers k; and k; between 1 and (p — 1)/2, inclusive, such that 
ak; = s,and ak; = t;(mod p). Thus, 


| 
-_™ 


p — ak, = p — 8s, = t; = ak, (mod p) 
which implies that 
a(k; + k;) = 0 (mod p) 

Since (a, p) = 1, then 

k, + k; = 0 (mod p) 
which is impossible, due to the fact that 

O<k;+k;<p-—Il 
Thus, the numbers 

P — S14, P — Sa, °°") D — Sy Uy, ta, os by 


are the numbers 1, 2, ---, (p — 1)/2 in some order. Taking the product of these 
numbers we obtain 


(Pp — 8i)(P — 52) (D—S,) ty tp = (2 


(= ssa) = (5) th te (PS) f (mod p) 


(—1)"s;S2 °° S,ty + ty = 2) (mod p) 


Because 51, 53, °°, Sy fy, °°*, 4, are congruent (mod p), in some order, to the 
numbers 
p-1 
a, 2a, °°, 5 “a 
then 


(—1)*a? ~ n2(P— "I =(—1)"a-2a-3a--- P= 


= (—1)"sy52 ++ Sytyty +++ t, 


IH 
area 
sS 
st 
— 
~~ 
°—— 
oN 
=| 
i) 
Q. 
Ss 
— 
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If we now cancel (p — 1)/2 !, we obtain 
(—1)"a? ~ 2/2 = 1 (mod p) 
and so (—1)" = a” ~ Y/? (mod p) 
Using Euler’s criterion, this implies that 
(a|p) = a? ~ 9? = (—1)" (mod p) 
and since p > 2, that 


(alp) = (— 1)" 


As an application we determine which primes have 2 as a quadratic residue. 


= THEOREM 5 / If the prime number p is of form 8” + 1, then 2 is a quad- 
ratic residue (mod p). If pis of form 8n + 3, then 2 is a quadratic nonresidue 
(mod p). 


PROOF: To apply the lemma of Gauss, we must consider the numbers 
p-1i 
2, 4, 6, +--+; ——— 2 
2 


Since these numbers are in the proper range, we must count the number that 
exceed p/2. Noting that 2k < p/2 if, and only if, k < p/4, we see that [p/4] of 
these numbers are less than p/2. Thus, the number of those that exceed p/2 is 


Case I. If p = 8n + 1, then [p/4] = 2n. Thus, 
(2|p) = (— I)" = (— DP 7 - IAT = (1) = I 
The other three cases are left for the reader (Exercise 1). 
We now use Theorem 4 to prove another criterion for quadratic residues. 


The main use of this result is as a lemma for the quadratic reciprocity law, to 
be proved in Section 9.5. 


=m THEOREM 6/ If a is an odd positive integer, relatively prime to the odd 


prime p, then 
(p-1)/2 


2 [ja/p] 
(alp) =(—1) ° 


PROOF: We will use the same notation as in Theorem 4. We divide each 
term ja (j = 1, 2, ---,(p — 1)/2) by p, obtaining 


jJa=qptr, O<r<p 
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The quotient qg is equal to [ ja/p] and the remainder r is one of the numbers s,, 
So5°*'s Syy ty, °+*; t,. Thus 


(p — are (p - a ja u v 
XJ aE \+ ~syt Dt (9.3) 
j=1 j= j=l 
In the proof of Theorem 4 we established that the numbers 

P — S15 D — S25 °**s D — Sys £4, °°', Ey 
are just the numbers 1, 2, ---, (p — 1)/2, in some order. Therefore, 


(p — 1)/2 


» j= Yes) + Lua Yat Ve (9.4) 


j=1 


Subtracting (9.4) from (9.3), we obtain 


(p - 1)/2 (p - ae ja u 
(a — 1) Py j= fy LS z|-+4+ cess (9.5) 


If we now reduce (9.5) modulo 2, recalling that a and p are congruent to 1, 
we obtain 


(P = 1/20 jg 
0= i" a, E “)-4 + 0 (mod 2) 
= a 
and so . x LEl= = u (mod 2) 


Applying Theorem 4, we see that 


(p~1)/2_. 
[ja/p] 


(alp) =(-1)"=(-1) * 
EXERCISES 


1. Complete the proof of Theorem 5. 
2. Prove that (2| p) = (—1)0"7 2/8 =(— 1) 0/4] 


PROBLEM 


1. Use Theorem 4 to prove that 3 is a quadratic residue, modulo the odd prime 
p, if, and only if, p = + 1 (mod 12). 
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9.5 / THE QUADRATIC RECIPROCITY LAW 


Let p and q be odd prime numbers. It is natural to inquire whether the 
quadratic character of p (mod q) affects that of gq (mod p). On empirical 
grounds, Euler stated that there is a connection between them. Two years later, 
Legendre stated his famous quadratic reciprocity law: 


(plg)(q|p) = (—1)@ 7 2 @- eR 


Legendre was able to prove his conjecture for certain cases, but was not able to 
extend his proof. At the age of 18, with no knowledge of the work of his pre- 
decessors, Gauss conjectured that the quadratic reciprocity law holds, and one 
year later was able to prove it. Gauss felt a personal attachment to this 
theorem and during his lifetime he was able to prove it in seven different ways. 
The proof we shall give is due to Eisenstein, one of Gauss’ students, and is 
based on a geometrical interpretation of Theorem 6. 
Using Theorem 6, we see that 


(p ~1)/2 (q -_1)/2 
ES Caley Ek psa 


(plq)-(qlp) =(-1) 7 >" (—1) “ 


(Pp (q 


— 1)/2 
Ee) Ualpl +E Ukp/ad 


=(-1) 7°" 
Thus, if we can establish that 


(p — y [2]+° x, [F]-2 dines r 1 — | (9 6) 
j=i1 k=1 


the reciprocity law will be proved. To prove (9.6) we count the lattice points in 
a portion of the X Y-plane in two different ways. 


= THEOREM 7 / (The Quadratic Reciprocity Law) If p and q are distinct 
odd prime numbers, then 


(playalp) = (— DO PD. 


PROOF: Consider the rectangle in the X Y-plane bounded by the lines x = 0, 
x = p/2, y =0, and y =q/2. We bisect this rectangle by the line y = gx/p (see 
Figure 9.1). Let 7, be the portion of the rectangle below the line and T,, be the 
portion of the rectangle above the line. 

Because p and q are prime numbers, none of the lattice points interior to the 
rectangle lies on the line y = qx/p. Hence, the lattice points all lie in 7; or in 
T,. To count the lattice points interior to T,, we observe that if 1 <j < p/2, 
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the number of lattice points in 7, that are directly above the point (j, 0) is 
[jq/p]. Thus, the number of lattice points in 7; is 


(p — y [=] 
j= a 
In a similar way, if 1 < k <q/2, the number of lattice points in 7,, directly to 
the right of the point (0, k), is [kp/q], and so T, contains 


(q - y [>] 
j= 2, 
lattice points. Thus, the number of lattice points in the rectangle is 


(p-—1)/2 7; q-1/2rk 
la 
j=l p k=1 q 


G/2 


p/2. 


Figure 9.1 p= 13,¢q= 11. 


It is obvious that the rectangle contains a total of 


p-\1q-\1 
2 2 
lattice points. It follows that 


(p ae 
“tiqlpy + 7s tkpial 


(planqlp) =(-1) 7 a a(S rem 
m COROLLARY 7.1 / Let p and q be distinct odd prime numbers. If both p 
and q are of form 4n + 3, then p is a quadratic residue (mod q) if, and only if, 
g is a quadratic nonresidue (mod p). If at least one of the primes is of form 
4n + 1, then p is a quadratic residue (mod q) if, and only if, g is a quadratic 
residue (mod p). 
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Example I. 
Show that 43 is a quadratic nonresidue (mod 1093). 


SOLUTION: Since 1093 is a prime of form 4n + 1, then 
(43/1093) = (1093/43) 
Since 1093 = 18 (mod 43), then 
(1093/43) = (18|43) = (2|43) (9/43) 
From Theorem 5 we see that (2|43) = —1, and, since 9 = 37, (9/43) = +1. 
Thus, 
(43|1093) = (2/43) (9/43) = (—1) (41) = —1 


and so 43 is a quadratic nonresidue (mod 1093). 
Example 2. 
Find all odd prime numbers for which 3 is a quadratic residue. 


SOLUTION: Since 3 is of form 4n + 3 we must consider two cases. 

Case I. Let p be a prime of form 4n+ 3. Then (3|p) = —(p|3), and so 3 is a 
quadratic residue (mod p) if, and only if, p is a quadratic nonresidue (mod 3); that 
is, if, and only if, p = 2 (mod 3). This requirement, together with the condition 
p = 3 (mod 4), implies that 3 is a quadratic residue (mod p) if, and only if, p = 11 
(mod 12) (by the Chinese remainder theorem). 

Case II. Suppose p is a prime of form 4n+ 1. Then (3|p) = (p]3), and 3 is a 
quadratic residue (mod p) if, and only if, p = 1 (mod 3). The two restrictions imply 
that 3 is a quadratic residue (mod p) if, and only if, p = 1 (mod 12). 

Combining the results of the two cases, we see that 3 is a quadratic residue, 
modulo the odd prime p, if, and only if, p = +1 (mod 12). 


Example 3. 
Find all odd prime numbers for which —3 is a quadratic residue. 


SOLUTION: Case I. Let p be a prime of form 4n + 1. Then 
(—3|p) = (—1|p): Blip) = GIP) 


Therefore, by Example 2, '—3 is a quadratic residue (mod p) if, and only if, p = +1 
(mod 12). Since p = 1(mod 4), this implies that p must be congruent to 1 (mod 12). 
Case II. Let p be a prime of form 4n + 3. Then —3 is a quadratic residue (mod p) 
if, and only if, 3 is a quadratic nonresidue (mod p), which, by Example 2, implies 
that p = +5 (mod 12). Since p = 3 (mod 4), then p must be congruent to 7 (mod 12). 
Combining the results of the two cases we find that —3 is a quadratic residue, 
modulo the odd prime p, if, and only if, 


p = 1 (mod 6) 
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EXERCISES 


1. Use the quadratic reciprocity law to evaluate the Legendre symbols: 

(a) (61]1217); 

(b) (67|1987); 

(c) (67|1997). 
2. Prove that if (x, y) is a lattice point interior to the rectangle described in the 
proof of Theorem 7, then (x, y) does not lie on the line y = qx/p. 


3. Find all odd primes p for which 2 and 3 are quadratic residues. 


PROBLEM 


1. Determine all odd prime numbers p for which q is a quadratic residue 
forg =5, 7, 11. 


9.6* / APPLICATION TO PRIMES IN ARITHMETIC 
PROGRESSION 


In Section 8.2 we established that the arithmetic progressions {4n — 1} and 
{6n — 1} contain an infinity of prime numbers, and observed that the proofs of 
the corresponding theorems for {47 + 1} and {67+ 1} are more difficult. 
Using results on quadratic residues we can now dispose of those cases as well. 


= THEOREM 8 / There is an infinity of prime numbers of form 4n + 1. 


PROOF: Assume the set of such primes is finite and let {5, 11, ---, p} be all 
of them. Consider the number 


N=(2:5-1i1- p> +1 


Since N is larger than p and is of form 4” + 1, then N must be composite. 
Furthermore, if g is a prime factor of N, then q is of form 4 + 3. But then 
(2°5-11-+: p)* = —1 (mod q), which, by Theorem 2, is impossible. 


Using Example 3 of Section 9.5, we can prove a similar result for primes of 
form 6n + 1. The proof is left for the reader. 


= THEOREM 9 / There is an infinity of prime numbers of form 6n + 1. 
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9.7* | QUADRATIC RESIDUES FOR A COMPOSITE MODULUS. 
THE JACOBI SYMBOL 


By use of the following theorem we can determine all quadratic residues for 
an odd composite modulus. The proof is left for the reader (Problems 1, 2, 
and 3). The more general case (quadratic residues for an even modulus) is 
considered in Problems 4 and 5. 


= THEOREM 10/ Let q,, qp, ---, g, be distinct odd primes and let 


Q= 4d dn” — ae 
The integer P is a quadratic residue (mod Q) if, and only if, P is a quadratic 
residue (mod q;) (j = 1, 2, -::, 7). 


For example, since (30|7) = 1 and (30/11) = —1, then 30 is a quadratic 
nonresidue (mod 7: 117). 

We now extend the definition of the Legendre symbol to cover the case 
where the modulus is an odd composite number. The most natural way to do 
this is to define (P|Q) = 1 if Pis a quadratic residue (mod Q), and (P|Q) = 
— 1 if P is a quadratic nonresidue. Unfortunately, if we do this, we lose the 
quadratic reciprocity law. This property is of such convenience that we choose 
a different, less natural, way of defining (P|Q), which preserves the quadratic 
reciprocity law. The extended symbol is due to C. G. J. Jacobi (1804-1851). 


m DEFINITION / Factor the odd integer Q as a product of prime numbers: 
O =4142 °°: q, (not necessarily distinct). Let P be relatively prime to Q. The 
Jacobi symbol (P|Q) is defined by 


(P/Q) = (Plai)(Pla2) «+: (Plan) 


where the symbols on the right are Legendre symbols. 


The fact that the Jacobi symbol (P|Q) is equal to + 1 does not imply that P 
is a quadratic residue (mod Q). For example, 


(30|7 - 112) = (30]7) - (30]11) - (30]11) = +1 


even though 30 is a quadratic nonresidue (mod 7 - 117). On the other hand, 
we get the following corollary to Theorem 10. 


=m COROLLARY 10.1 / If P and Q are relatively prime and (P|Q) = —1, 
then P is a quadratic nonresidue (mod Q). 
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Before proving the extended quadratic reciprocity law, we establish some 
simple properties of the Jacobi symbol. In the statement of the theorem we 
assume that the Q’s are odd and the respective P’s are relatively prime to the 


O's. 


=« THEOREM 11 / 
(a) (P|Q,Q>) = (P|Q,) (P|Q>); 
(b) (P,P2|Q) = (Pi|Q) (P2|Q); 
(c) if P, = P, (mod Q), then (P,|Q) = (P2|Q). 


SKETCH OF THE PROOF: (a) follows immediately from the definition of 
the Jacobi symbol. (6) and (c) are consequences of the fact that the Jacobi 
symbol is a product of Legendre symbols, and the respective properties hold 
for Legendre symbols. 


=» LEMMA / If a,, a5, ---, a, are odd integers, then 


a,-1 a,-1 a, —1 _ 4142" 
fe ee a 


ned 2) 


PROOF: If n = 2, then, since (a, — 1) and (a, — 1) are even, 


a,a,—1 (A5- + pA) =e ee 
2 2 a. 2 
—1 —1 
2 ae) Aas ) — 0 (mod 2) 
Thus, 
a,;—-1 a,—-1_ a,a,—-1 
nn ae) 


(mod 2) 


The rest of the proof follows easily by mathematical induction and is left for 
the reader. 


m THEOREM 12/ (The Extended Quadratic Reciprocity Law.) If P and 
Q are relatively prime odd positive integers, each greater than 1, then 


(P|Q)(O|P) = (— 1) 7 2/2 DP? 


PROOF: Let P =p; P2 -*: Pm and QO =4q,q2 --: q, be the factorizations into 
prime numbers. Applying Theorem 11 several times, we obtain 


n 


(pilq;)° I LU Gulp 


CACRICRAD 


(Pi) (@lP)= TI, TT 
wu 


io io 
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From the quadratic reciprocity law for Legendre symbols it follows that 
P P)= 1 OT _ 4 )l(pi = 1)/21ay ~ 19/2] 
(PIQ)(QIP) A de ) 


EES (pi 1)/2+ (gy - 1/2 


= (-—1) 1j=1 
_ (= nt x (ei - nat, pS (49 — 1/2] 
From the lemma we see that 
ai pP-—1 Pi1P2 ** Dn — 1 P 1 
2 2 y) 5 (mod 2) 
n q oe 1 a 1 
d j my Daas re 
j=1 2 D) (mod 2) 
which implies that 


(PIQ\(Q|P) = (— ni Eo orl, Ear vr} 


= (—1)? — 1)/2°-(Q- 1)/2 


With the quadratic reciprocity law we can invert the Legendre symbol when 
both of the numbers involved are primes. The extended quadratic reciprocity 
law allows us to invert them when one of the numbers is composite. This is one 
reason for defining the Jacobi symbol as we did. Its connection with quadratic 
residues is rather incidental. 


Example. 
Show that the congruence x? = 15 (mod 1093) has no solution. 


SOLUTION: The prime number 1093 is of form 4n + 1. Thus, 


(15|1093) = (1093]15) = (—2|15) = (—2|3)(—2|5) 
= (1|3)(3|5) = (+1—)d) = —1 


Because the Legendre symbol (15|1093) is negative, 15 is a quadratic nonresidue 
(mod 1093). 


EXERCISES 


1. Which of the following congruences have solutions? 


(a) x* = 39 (mod 131); 
(b) x? = 78 (mod 131); 
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(c) 2x? + 5x — 7 =0 (mod 79); 
(d) x? = 20 (mod 117) (composite modulus). 


2. By an example, show that if we define the symbol (a/b) equal to +1 or —1 
according to whether a is a quadratic residue or nonresidue, modulo 5, then 
the quadratic reciprocity law does not hold for composite b. 


PROBLEMS 


1. Let m and n be relatively prime positive integers. If a is relatively prime to 
mn, prove that a is a quadratic residue (mod mn) if, and only if, a is a quadratic 
residue (mod m) and (mod n). [Hint: If x)” = a (mod m) and y,.* = a (mod n), 
we wish to find a solution to 


z? = xX,” (mod m) 
z? = y,.” (mod n) 


2. Let p be an odd prime number and let 7 be a positive integer. Prove that a 
is a quadratic residue (mod p”) if, and only if, a is a quadratic residue (mod p). 
[Hint: If x9? =a + mp, show that one of the numbers 


Xo: Xo + Ps Xo + 2p, Xp + 3p, °° 


is a solution of z? = a (mod p’”).] 


3. Prove Theorem 10. 
4. Let k be an odd integer. 
(a) Find all solutions of x* = k(mod 2). 
(b) Find all solutions of x? = k (mod 4). 
(c) If B = 3, prove that 
x? = k (mod 2°) 
has a solution if, and only if, k = 1 (mod 8). 


5. Use Problems 1, 2, and 4 to generalize Theorem 10. 


THE TWO- 
AND FOUR-SQUARE 
PROBLEMS 


. 
; 

i 

$ 

i 

| 

es id 


10.1 / INTRODUCTION 


In his correspondence with Fermat, Bachet mentioned that Diophantus had 
assumed that each positive integer can be written as a sum of four squares. 
Bachet verified this assumption for all integers up to 325, but was unable to 
devise a proof. A few years later, Fermat stated that he had a proof of this 
result, but, since he was more interested in proposing problems for other 
mathematicians than in exhibiting his methods of proof, this proof (if he had 
one) was lost to posterity. 

The difficulty of the problem is evidenced by the fact that even Euler could 
not solve it. He did, however, make significant progress towards a solution. In 
1770, over a hundred years after Fermat’s death, J. L. Lagrange used a result of 
Euler (Lemma 4 of Section 10.3) to prove the assumption made by Diophantus. 

Wecan never know if Fermat had a valid proof. Most authorities believe that 
he did have one, since several of the known proofs of the “four-square theorem”’ 
rely on the “method of descent,”’ one of Fermat’s favorite techniques.’ 

Let us examine the first few positive integers and see how many squares are 
needed to represent them. 


1=1? (one square needed) 
2=17+ 1? (two squares needed) 
3=17+174+1? (three squares needed) 
4=2? (one square needed) 
5 = 27 + 1? (two squares needed) 
62.2" a1 (three squares needed) 
7=27+17+17+ 17 (four squares needed) 
8 = 2? + 2? (two squares needed) 
9 = 3? (one square needed) 
10 = 37 + 1? (two squares needed) 


1The method of descent is a variation on the well-ordering principle. An 
illustration of the technique is in Section 11.3. 
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We see from our small table that the first ten positive integers can be 
written as sums of four squares (counting 07) and that one of them cannot be 
written as a sum of less than four squares. What is more interesting is the fact 
that most of these numbers can be written as sums of two squares. 

Diophantus studied the problem of representing numbers as sums of two 
squares and sums of three squares and gave necessary conditions that these 
representations exist. Unfortunately, the surviving texts are unclear and we do 
not know if he had a correct criterion. 

In the following section we will study the problem of representing positive 
integers as sums of two squares. 


10.22 / THE TWO-SQUARE PROBLEM 


In this section we will establish necessary and sufficient conditions which 
insure that a positive integer can be written as a sum of two squares. 

If we examine the brief table in Section 10.1, we see that (allowing one of the 
squares to be 07) the following positive integers can be written as sums of two 
squares: 1, 2, 4, 5, 8, 9, 10. If we consider the products of these numbers, we 
see that they can also be represented in this manner. If we can prove that this is 
always the case, we will have achieved an important first step in solving the 
two-square problem. 


= THEOREM 1 / If each of a and 5 can be written as a sum of two squares, 
then the product ab can also be written as a sum of two squares. 
SKETCH OF THE PROOF: The proof follows from the algebraic identity: 
(s? + t?)(u? + v?) = (su + tv)? + (sv — tu)? 


The next problem we will consider is that of deciding which prime numbers 
can be written as sums of two squares. Because it is obvious that the prime 
number 2 can be written in this form, we may restrict ourselves to odd primes. 
The following lemma, due to the Norwegian mathematician Axel Thue, 
provides an essential step in the proof. 


a LEMMA / Let p bea prime number and let a be relatively prime to p. The 
congruence 

x = ay (mod p) 
has a solution X9, Yo such that 


0<|xol< Jp and O0<|yl<Jp 
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PROOF: Let m =[,/p]. Consider the set of numbers 


1+a,1+2a,:--,1+ (m+ la 
2+a,2+2a,---,2+(m+ la 


(m+1) +a, (m+ 1) + 2a, ---, (m+ 1) + (m+ Da 


This set contains (m + 1)? numbers (not necessarily distinct). Since (m + 1)? 
> p, at least two of the numbers, say x, + y,a and x, + y,a, must lie in the 
same residue class (mod p), where x, # x2 or y; # yz. Thus, 


(x, — X2) = (2 — y)a (mod p) 


Because one of the terms (x, — x.) and (y, — y,) is not zero, it follows that 
neither is zero. Since x,, X2, ¥,;, Y2 are all greater than zero and less than or 
equal to m + 1, then 


0< |x,-—x,|<m<./p and 0<|y2.-yil<sm<./p 


If we let x» = x, — X, and yp = y2 — y;, the proof is complete. 


=» THEOREM 2/ The odd prime p can be written as a sum of two squares if, 
and only if, — 1 is a quadratic residue (mod p). 


PROOF: (Part I) Suppose p is the sum of two squares, say p = a? + b?. 
Since p is a prime number, then b # 0. This implies that there is an integer c 
such that 


cb = 1 (mod p) 
Thus, (ca)? + (cb)* = (ca)? + 1 = 0 (mod p) 
Since (ca)* = — 1 (mod p) 


then —1 is a quadratic residue (mod p). 


(Part II) If —1 is a quadratic residue (mod p), there exists an integer a such 
that 
a” +1=0 (mod p) (10.1) 


Because the integer a must be relatively prime to p, then by the lemma, there 
exist integers Xp and yo, 


0 < [Xo <,/p, 0< Iyol <,/p 
such that Xo = Yoa (mod p) 
If we now multiply (10.1) by yo2, we obtain 
Yo (a” + 1) = (0a) + Yo’ = Xo + Yo. = 0 (mod p) 
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The restrictions on the size of x) and yg insure us that 
0< x,’ + Yo" <2p 


and so Xo? + yo? =P 


The following theorem now follows easily from Theorem 2 and Corollary 
2.1 of Chapter 9. 


= THEOREM 3 / The odd prime number p can be written as a sum of two 
squares, if, and only if, p is of form 4n + 1. 


It follows from Theorems 1 and 3 that any number of form mk, where k has 
no prime factors of form 4n + 3, is a sum of two squares. We will now prove 
that these are the only numbers that can be represented in this manner. 


= THEOREM 4 / Write the positive integer N in the form N = mk, where 
k does not contain a square factor. Then N can be written as a sum of two 
squares if, and only if, kK has no prime factor of form 4n + 3. 


PROOF: If k has no prime factors of form 4n + 3, it follows from Theorem 1 
that k can be written as a sum of two squares, say k = a” + b?. Therefore, 
mk =(ma)* + (mb), a sum of two squares. 

To prove the converse, suppose that N = a” + b*. Let d=(a, b). Then 
d*|N, so that d?|m’. If we write a = a’d, b = b’d, then 

2 2 
_ = a7- = qa’? + b’? = 0 (mod k) (10.2) 


Let p be an odd prime factor of k. Since a’ and b’ are relatively prime, then one 
of them, say a’, is relatively prime to p. Pick an integer c such that 
ca’ = 1 (mod p) 
From (10.2) we see that 
(ca’)* + (cb’)* = 1 + (cb’)* = 0 (mod p) 


which implies that —1 is a quadratic residue, modulo p. Thus, p is of form 
4n+1. 


EXERCISES 


1. Express all of the primes of form 4n + 1 between 100 and 150 as sums of 
two squares. 
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2. Which of the following can be written as a sum of two squares? 

(a) 2+34-5-7?; 

(b) 819. 
3. Let N = m?k = a* + b*, where k does not have a square factor. Let d= 
(a, b). Prove that d?|m?. 
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We have determined which numbers can be written as sums of two squares. 
Certain numbers (for example, 3 and 11) can be written as sums of three 
squares, while others (such as 7 and 23) require four squares. In this section 
we will prove the four-square theorem. In the problem section the three- 
square problem will be considered briefly. 


=» LEMMA 1 / If aand b can be written as sums of four squares, so can the 
product ab. 


SKETCH OF THE PROOF: The proof follows from the following algebraic 
identity, first noticed by Euler: 


(X17 + X_7 +37 + X47? + 2? + Y37 + V4") 
= (X1V1 + X2V2 + X33 + X44)? + (Hp V2 — XV + X3V4 — X43)? 
+ (X13 — X3V1 + X4V2 — X24)? + (HK V4 — X41 + X23 — X3)2)? 


It follows by mathematical induction that if a,, a2, ---, a, can be written as 
sums of four squares, then the product a,a, --- a, can be written as a sum of 
four squares. Thus, it is sufficient to prove that each prime number can be 
written in this manner. 

In the following lemma we consider a proposition for ordered pairs of 
integers that is similar to a property of residue classes. A matter of notation: 
Wesay that the ordered pairs of integers (a, b) and (c, d) are congruent (mod m) 
ifa=cand b=d (mod m). 


s LEMMA 2/ A set of ordered pairs of integers, containing more than m7” 
elements, must have two elements that are congruent (mod m). 


PROOF: Each ordered pair of integers is congruent (mod m) to one of the 
following m7 ordered pairs: 


(1, 1), C1, 2), ---, (, m) 
(2, 1), (2, 2), ee (2, m) 


(m, 1), (m, 2), ---, Gm, m) 


158 « TWO- AND FOUR-SQUARE PROBLEMS 


If a set contains more than m? ordered pairs, at least two of them must be 
congruent (mod m) to one of the ordered pairs in the above list. Obviously, 
these two ordered pairs are congruent (mod m). 


= LEMMA 3/ Leta, b, c, and d be given integers, let p be a prime number. 
The system of congruences 


ax + by — z=0 (mod p) 
cx + dy —u=0 (mod p) 


has a nontrivial solution X9, Yo, Zo, Uo such that each number in the solution is 
less than ,/p in absolute value. 


PROOF: Let x, y, z, and u be independent variables. Define a and B as 
functions of x, y, z, u by 


a= a(x, y, Z,u)=ax+by—z 
B = B(x, y, z,u) =cx+dy—u 


As x, y, z, and u vary over the domain {0, 1, ---,m=[,/p]}, we obtain 
(m + 1)* values of « (not necessarily distinct) and (m + 1)* corresponding 
values of B. Thus, we have (m + 1)* ordered pairs («, B) with « and £ corre- 
sponding to the same values of x, y, z, u. Since (m + 1)* > p?, it follows from 
Lemma 2 that at least two ordered pairs, (a,, B,) and (2, B,), have correspond- 
ing components that are congruent (mod p). Thus, 


a, = ax, + by, — 2, =a, = ax, + by, — z, (mod p) 
and 
B, = CX, + dy, = uj; = B, = CX 2 + dy, — uz (mod p) 


which implies that 


a(x, — X2) + 6, — y2) — (2 — 22) = 0 (mod p) 
and 
C(x, — X2) + Ay, — ¥2) — (Uy, — uz) = 0 (mod p) 


Since (a,, B,) and (a2, B,) correspond to different values of x, y, z, and u, then 
at least one of the numbers x, — x2, y; — 2, Z; — Zz, Uy — Uz iS not zero. If 
we recall the range of values for x, y, z, and u, we see that x, — x, y, — y2, 
Z, — Zz, and u, — uz and all less than ,/p in absolute value. If we now let 
Xp =X1— Xa. Vo =V1 — Va 2% = 2% —2Z2, and uy =U, — Uz, the proof is 
complete. 


The following lemma is the one proved by Euler and later used by Lagrange 
in his proof of the four-square theorem. 
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=» LEMMA 4/ If pis an odd prime number, there exist integers a and b such 
that 
a? + b? = —1 (mod p) 


PROOF: Consider the following two sets: 


A= (0? 12. 22. ... (>=) | 
9 9 > > ) 

—4\2 
B= {—o7 af, a, eee 1, =9* 1,--,— (2) =e | 


Obviously, the elements of A are incongruent (mod p) as are the elements of B. 
Since the union of A and B contains more than p elements, at least two 
numbers in the union are congruent (mod p). One of these numbers, say a’, 
must be in A and the other, say —b” — 1, in B. Thus, 


a? + b? = —1 (mod p) 


=» THEOREM 5/ Each prime number can be written as a sum of four 
squares. 


PROOF: We have verified that the theorem is correct if p = 2 or p = 3. Thus, 
we may restrict ourselves to proving the theorem for primes greater than 3. By 
Lemma 4, there exist integers a and b such that 


a? + b? = —1 (mod p) 

As a special case of Lemma 3 we find a nontrivial solution Xo, Yo, Zo, Uo of 

ax + by =z (mod p) 

bx — ay = u (mod p) 
such that each of the numbers is less than Jp in absolute value. Thus, 

Zo” + Up” = (aX + hyo)” + (bx — yo)” 

(a? + b*)(x9” + Yo) 
—(Xo” + Yo") (mod p) 


and so 
X97 + Yo” + Zo + Up” = 0 (mod p) 


Using the restrictions on the size of x9, Yo, Zo, Uo, we find that 
0<X9? + Yo? +207 + Uo” < 4p 
and so Xo? + Yo? + Zo? + Uy” 


is one of the numbers p, 2p, or 3p. 
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Case I. If x9? + yo? + 27 + Uo” =p, we are through. 
Case II. If Xo? + yo” + Zo” + Up” = 2p, then two of the four numbers, say 
Xo and yo, must be even and the other two odd. If we now expand 


(2 as re) 4 (= = re) a: (- a Ho) fe (72 zi He) 
2 Z 2 2 
we find that this number is equal to p, and so p is a sum of four squares. 
Case III. If x9? + yp? + Zo” + Up? = 3p, then exactly one of the numbers, 


Say Xo, iS divisible by 3. If we allow negative values for some of the other 
numbers, we may assume that yp = Zp = Up (mod 3). If we now expand 


Yo + Zo + Uo\7 Xo + Zo — Uo\? Xo — Vo + Uo\” 
eg ae 


Note Vo way" 
+(2 Yo ) 


3 


we find it is equal to p. 
Thus, in all three cases, p is a sum of four squares. 


=» COROLLARY 5.1 / Each positive integer can be written as a sum of four 
squares. 


EXERCISES 


1. Write the following numbers as sums of four squares. Which of these 
numbers can be written as sums of three squares? 

(a) 31; 

(6) 47; 

(c) 94. 


2. Prove that in Case II of the proof of Theorem 5, two of the numbers Xp, yo, 
Zo Uo are even and two are odd. Prove that in Case III exactly one of the 
numbers 1s divisible by 3. 


PROBLEMS 


1. Prove that there is an infinity of positive integers that cannot be written as 
sums of three squares. [Hint: Show that no integer of form 8k + 7 can be 
written as a sum of three squares. | 


2. Prove that no integer of form 4"°(8k + 7), n => 0, can be written as a sum of 
three squares. [It can be proved that these are the only integers that cannot be 
written as sums of three squares, but the proof is beyond the scope of this book. ] 
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10.4* / REMARKS ON WARING’S PROBLEM 


About the time that Lagrange proved the four-square theorem, the English 
mathematician Edward Waring stated that each positive integer can be 
written as a sum of nine cubes, nineteen fourth powers, “‘and soon.”’ Waring 
probably arrived at this conjecture by studying tables; it is unlikely that he 
could prove it for any of the cases involved. 

In the nineteenth century, a number of mathematicians worked on this 
problem. They proved that for k = 3, 4, 5, 6, 7, 8, and 10, each positive 
integer can be written as a sum of a fixed number of kth powers. The major 
advance in this direction was made by the great German analyst David 
Hilbert, who, in 1906, proved: 

If k is a positive integer, there is a number g(k) such that each positive 
integer can be written as a sum of g(k) kth powers. 

Unfortunately, Hilbert’s proof was not constructive. It established the exist- 
ence of g(k), but could not be used to determine the value of this number. 
[From Corollary 5.1 we know that g(2) = 4.] Since Hilbert’s time many out- 
standing mathematicians have worked on this problem.” The exact value of 
g(k) is now known for all values of k except k = 4 and k =5. The known 
results for small values of k are: g(2) = 4, g(3) = 9, 19 < g(4) < 35, 37 < g(5) 
< 54, 9(6) = 73. 

If we examine tables for the representations of integers as sums of cubes, we 
find that the first integer requiring nine cubes in 23 and the next is 239. The 
surprising fact is that these are the only integers that are known to require 
nine cubes. Every other integer that has been checked has been written as a 
sum of eight cubes. If we pursue this line of investigation further, we find that 
apparently only fifteen other numbers require as many as eight cubes, the 
largest being 454. 

With this in mind, it is natural to define G(x) to be the minimum number of 
kth powers needed for the representation of all sufficiently large numbers; 
that is, all numbers above some bound, which of course, also depends on k. 

In Problem 2 of Section 10.3 it was established that there is an infinity of 
numbers that cannot be written as sums of three squares. Thus, G(2) = g(2) 
= 4, 

In general, we would expect to find that G(k) is smaller than g(k). The 
heuristic argument is as follows: We know that all positive integers can be 
written as sums of g(k) kth powers. For the small positive integers there are 


?Many of the important results were obtained by the English analysts 
G. H. Hardy and J. E. Littlewood and by the Russian mathematician 
I. M. Vinogradov. 
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few kth powers available for the representations and the number of different 
representations for a fixed integer is very limited. As we go further in the 
sequence of integers, we have many more kth powers at our disposal and, 
consequently, we would expect to have several representations for each 
sufficiently large integer. Among these representations, it is likely that at 
least one will require fewer than g(k) kth powers. The small integers that 
require g(k) kth powers are thus seen to be oddities—consequences of the 
scarcity of kth powers at the beginning of the sequence of integers. 

The above argument seems to be justified by the known results. It has been 
proved, for instance, that for most values of k, g(k) is larger than k - 2* ~}, 
but that, in general, G(k) is no larger than approximately k - log k. Thus, for 
most values of k, G(k) is much smaller than g(k). 

The exact value of G(k) is only known for k = 2 and k = 4. As mentioned 
above, we have proved that G(2) = 4. In 1939, H. Davenport proved that 
G(4) = 16. It is known that G(3) is one of the numbers 4, 5, 6, or 7. We do not 
know which, but examination of extensive tables indicates that it is probably 
either 4 or 5. 


SOME NONLINEAR 
DIOPHANTINE 
PROBLEMS 


There is no simple technique for the solution of nonlinear diophantine 
equations such as Euler discovered for linear diophantine equations (Section 
2.3). In this chapter we will consider some isolated examples of nonlinear 
diophantine equations, chosen primarily for historical reasons. 


11.1 / THE PYTHAGOREAN PROBLEM 


The Pythagorean identity a? + b* = c? (where a and b are the lengths of 
the sides of a right triangle and c is the length of the hypotenuse) has been 
known since antiquity. The ancient Greeks were very interested in finding right 
triangles such that a, b, and c are positive integers. Such a triple of integers a, 
b, c, is called a Pythagorean triple. The associated triangle is called a 
Pythagorean triangle. Common examples are 3, 4, 5 and 5, 12, 13. 

Two natural questions arise: 


1. Is there an infinity of Pythagorean triples? 


2. Is there a formula that can be used to generate all Pythagorean 
triples? 


To answer the first question we observe that if a, b, c is a Pythagorean triple 
and k is a positive integer, ka, kb, kc is also a Pythagorean triple. Thus, the 
answer is obviously affirmative. For example, we obtain the Pythagorean 
triples 6, 8, 10; 9, 12, 15; 12, 16, 20; and so on, from the triple 3, 4, 5. 

Suppose now that a, b, c is a Pythagorean triple and that d = (a, b). Then 
obviously, d|c?, and so d|c. It follows by a similar argument that 


d = (a, b) =(a, c) = (8, c) 
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If we factor a = a'd, b = b'd, c = c'd, then 
a?+b* c 


d? d 


which means that a’, b’, c’ is a Pythagorean triple in which the numbers are 
pairwise relatively prime. 


q’” ae b’2 ae 


= DEFINITION / A set of pairwise relatively prime positive integers, a, b, c 
is called a primitive Pythagorean triple if a + b? = c?. 


Question 1 becomes more interesting if we rephrase it as: 


1’. Is there an infinity of primitive Pythagorean triples? 


If we can find all primitive Pythagorean triples, we obviously can use them 
to generate all Pythagorean triples. Therefore, in this section we will con- 
centrate on solving the diophantine equation 

x? ok ‘y? _ z2 
in positive integers, subject to the side condition (x, y) = 1 [which, of course, 
implies that (x, z) = (y, z) = 1]. 

In order to keep the proof of the main theorem relatively simple, we will 

establish several of the needed results in a series of lemmas 


=» LEMMA 1/If x, y, z is a primitive Pythagorean triple, then one of the 
numbers x, y is even and one 1s odd. 


PROOF: Because the triple is primitive, x and y are obviously not both even. 
Suppose that both are odd. Then 
x? = y? = 1 (mod 4) 
and so z? =x* + y? =2 (mod 4) 
which is impossible since the square of an even integer is congruent to zero 


(mod 4). Thus, one of the numbers x, y is even and one 1s odd. 


In order to have a standard notation, we will assume in the sequel that x is 
odd and y is even. Obviously, z is odd. 


m LEMMA 2/ If x, y, zis a primitive Pythagorean triple, then the integers 


— aaa = 


are relatively prime. 


11.1 | THE PYTHAGOREAN PROBLEM w@ 165 


PROOF: Any common divisor of 


Z—-xXx Z+xX 
d 
y ia 2 


must divide their sum and difference—the numbers x and z. Since (x, z) = 1, 


this means that 
> | _| 
aan te ie 


=» LEMMA 3/ If a and 5b are relatively prime positive integers having a 
square as their product, then a and 6 are also squares. 


The proof is left for the reader (Exercise 2). 


We are now able to prove that there is a formula (in two parameters) that 
can be used to generate all primitive Pythagorean triples. 


= THEOREM 1 / The positive integers x, y, z constitute a primitive Py- 
thagorean triple if, and only if, there exist two relatively prime positive 
integers s and t, s > t, one even and one odd, such that 


x=s?— 12? 
y = 2st 
z=s* +t? 


PROOF: (Part I) Let x, y, z be a primitive Pythagorean triple. Then 


y? =27— x? =(z+ x) (z— x) 


2 — 
and so (;) Be a (11.1) 


2 2 2 


By Lemma 2 the factors on the right are relatively prime, and by Lemma 3 
each of them is a square, say 


zZ+xX_ 2 Smee 


2 Z 


where s and ¢ are positive integers. By adding and subtracting these quantities, 
we obtain 

x=s*—t?,z= 37 +0? 
From (11.1) we obtain 


(t) <2? t*-s 
2 2 


and so y = 2st 
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Obviously, any common divisor of s and ¢ will divide both x and z, meaning 
that (s, t) = 1. In particular, we see that s and ¢ are not both even. If they are 
both odd, then x, y, and z are all even, which is impossible. Thus, one of the 


numbers s, ¢t is even and the other is odd. 
(Part II) The “ only if” part of the proof is left for the reader (Problem 1). 


=» COROLLARY 1.1 / There is an infinity of primitive Pythagorean triples. 
A few of the primitive Pythagorean triples are listed in Table 11.1. 


Table 11.1 


S t x y Z 
(s? — t”) (2st) (s*4+ 17) 


2 ] 3 4 5 
4 I 15 8 17 
6 ] 35 12 37 
3 2 5 12 13 
5 2 21 20 29 
EXERCISES 


1. Construct a table of all primitive Pythagorean triples with z less than 100. 


2. Prove Lemma 3. 


PROBLEMS 


1. Complete the proof of Theorem 1. 


2. (a) Observe that if s=7¢-+ 1, the conditions of Theorem 1 are satisfied. 
Derive the parametric form of x, y, z in terms of t¢ for this special case. 

(b) Let x,, y,, Z, be the values of x, y, z in part (a). Obtain a recursive formula 
for X, + 1>:+1> 214 11n terms of x,, y,, Z,. (This special case of Theorem | was 
known to the ancient Greeks.) 


3. Ifnis an odd integer greater than 1, prove that there is a primitive Pythago- 
rean triple with x =n. If n is a prime, prove that only one such Pythagorean 
triple exists. 
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4. By a method similar to the proof of Theorem | obtain all “ primitive”’ 
solutions of 
ME Dy SF 


5. Find all “‘ primitive” solutions of 


x2 4 y? = 24 


11.2 / HISTORICAL NOTE: FERMAT’S LAST THEOREM 


Fermat made a practice of annotating his copy of the works of Diophantus 
with marginal notes. With one exception, the theorems mentioned in the 
margins have since been proved correct. 

One of Diophantus’ problems is closely related to Pythagorean triples (see 
Problem 1). In the margin next to this problem Fermat wrote the following 
comment: 


However, it is impossible to write a cube as a sum of two cubes, a fourth 
power as a sum of two fourth powers, and, in general, any power beyond the 
second as a sum of two similar powers. For this I have discovered a truly 
wonderful proof, but the margin is too small to contain it. 


In other words, the diophantine equation 
x" + y= 2" n= 3 


has no positive solutions. 

This problem, now called Fermat’s Last Theorem, has been attacked from 
every conceivable standpoint by the best mathematicians of the last 300 years. 
Many interesting results have been established, but the “theorem’”’ has never 
been proved. 

E. Kummer (1810-1893) made the greatest advances toward a solution. 
Instead of confining himself to the field of rational numbers, he extended his 
concept of number theory to include the algebraic numbers (those complex 
numbers that are roots of polynomials with rational coefficients). In 1843 he 
submitted what he thought was a proof, but Dirichlet pointed out a flaw in the 
argument. Kummer had assumed that factorization into “ primes”’ is unique in 
a certain subring of the algebraic numbers, when, in fact, this factorization is 
not unique. Because this assumption was essential, the proof was not valid. 

Kummer returned to the problem and, by using the theory of ideals, he was 
able to salvage parts of his proof and to establish very general conditions for 
the insolvability of Fermat’s equation. Most of the progress made on the 
problem in the last century has been along the lines of Kummer’s theory. 
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With the advent of the high-speed electronic computer, it has become pos- 
sible to check Kummer’s criteria for large exponents. At the time of this 
writing it has been proved that the diophantine equation 


x"+ y" = 2" 


has no positive solutions if 3 < n < 25,000. 

Fermat’s conjecture has received much attention from nonmathematicians 
in this century. This is partially due to the fact that in 1908 the German 
mathematician Wolfskehl left a bequest of 100,000 marks to the person who 
first solved it. The immediate consequence of the bequest was that the referees 
were swamped with “solutions” from amateurs, many of whom did not even 
understand the problem. Interest among amateurs has declined since the value 
of the prize was wiped out in the post-World War I inflation. 

We can be fairly safe in assuming that Fermat never had a valid proof. 
Since the problem has been attacked from every conceivable angle for 300 
years, it is unlikely that he had a method of proof that has not been redis- 
covered. 


EXERCISE 


1. Inkeri has proved that if Fermat’s equation does have a solution X9, yo, Zo 
for some positive integer 7 (with x9 < Yo < Zo), then 


2n? + 1\" 

(1) 0 (=| if XoYoZo # 0 (mod n) 
log 3n 

and 

(ii) Xo >n"—* — if xp¥oZ = 0 (mod n) 


Use the bound on 7 mentioned in the text to show that x) would have more 
than 320,000 digits. 


PROBLEM 


The following is the problem of Diophantus mentioned in the text: 

1. If ais a fixed positive integer, the equation x” + y” = a” has an infinity of 
rational solutions. [Hint: If x and y are rational, the number m defined by 
x = my — aisalso rational. Show that x and y can be expressed in terms of the 
parameter m.] 
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11.3* / SOME ELEMENTARY RESULTS ON FERMAT’S 
LAST THEOREM 


One of Fermat’s favorite methods of proof was the “ method of descent.”’ 
This is a variation on the well-ordering principle in which we show that if 
there exists a positive integer with a certain property, there must exist a 
smaller positive integer that has the same property. Thus, we obtain an infinite 
decreasing sequence of positive integers all having the specified property, 
which is clearly an impossibility. We will illustrate this method of proof by 
proving a special case of Fermat’s last theorem—a case which Fermat himself 
probably proved. 


= THEOREM 2/ The diophantine equation x* + y* =z? has no positive 
solutions. 


PROOF: Suppose the diophantine equation x* + y* =z? has a solution 
Xo, Yo. Zo. By an argument similar to that in Section 11.1, we may assume that 
Xo, Yoo Zo are pairwise relatively prime, Xp is odd, and yo is even. Since 
(X97) + (¥07)* = Zo” then X97, Vo”, Zo iS a primitive Pythagorean triple. Thus, 
there exist relatively prime positive integers s and t, s > t, one even and one 
odd, such that 

Xo? =5? — 2? 

Yo" = 2st (11.2) 

Zo = s? +t? 
From the first equation in (11.2) we see that xo, t, s is a primitive Pythagorean 
triple with x) odd and ¢ even. Using Theorem 1 again we find that relatively 
prime positive integers a and b exist, a > b, one even and one odd, such that 


Xp =a*?—b* t=2ab s=a’*+b? 
On substituting this into the second equation in (11.2), we obtain 


Yo” = 2(a? + b*) + 2ab 


y 2 
so that (2) = (a? + b*)ab 


Since a and b are relatively prime, then a” + b?, aand b are pairwise relatively 
prime. Applying Lemma 3 of Section 11.1 twice, we find that a, b and a? + b? 
are all squares, say 


a = Ay’ b = by? s=a*+b? =5, 
Thus, So? = a* + b*? =a,* + by* 


Since So ss<s*t+t? =2Z 
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we see that if a positive solution x9, yo, Zo exists, there must exist another 
positive solution ao, bo, 59 with so < Zo. It follows from the method of descent 
that no positive solutions can exist. 


= COROLLARY 2.1 / The diophantine equation x* + y* = z* has no posi- 
tive solution. 


We can now use Corollary 2.1 to show that Fermat’s equation has no 
solution in an infinity of cases: 


m= COROLLARY 2.2 / The diophantine equation x" + y" = z" has no positive 
solution if nm = 0 (mod 4). 


PROOF: If 2 = 4m and Xo, yo, Zo iS a positive solution of x” + y” = 2”, then 
Xo”, Yo", Zo” iS a positive solution of x* + y* = z+, which contradicts Corol- 
lary 2.1. 


By a simple modification of the proof of Corollary 2.2 we can show that it is 
sufficient to prove Fermat’s last theorem for the special case in which the 
exponent is an odd prime number. The proof is left for the reader. 


= THEOREM 3/ If there exists a positive integer n (n > 3) such that 
x" + y” = z" has a positive solution, there exists an odd prime number p (which 
divides n) such that x? + y? = z? has a positive solution. 


EXERCISE 


1. Prove Theorem 3. 


PROBLEMS 


1. If there exists a right triangle with rational sides such that the area is the 
square of an integer, show that there exists a primitive Pythagorean triple such 
that the area of the associated triangle is also a square of an integer. 


2. Use the method of descent to prove: There exists no primitive Pythagorean 
triple such that the area of the associated triangle is a square. Combine this 
result with that of Problem 1 to obtain a more general theorem. 
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11.4* / THREE DIOPHANTINE PROBLEMS 


In this section we will propose three diophantine problems (two of a 
classical nature). The solutions are left for the reader. 


A PROBLEM OF DIOPHANTUS 


The following problem was proposed by Diophantus (Problem 18, Book VI 
of Arithmetic): “‘ Find a Pythagorean triangle in which the bisector of one of 
the acute angles is rational.”’ 

Let us make some preliminary observations: (1) The triangle with sides ka, 
kb, kc has this property if and only if the triangle with sides a, b, c has it. Thus, 
we need only consider primitive Pythagorean triangles. (2) If a is the odd 
number in the primitive Pythagorean triple, a, b, c, then (using the standard 
notation) the bisector of angle A is irrational. 


Problem I. (a) Prove the above statements. (b) Prove that there exists 
an infinity of primitive Pythagorean triangles such that the bisector of the 
angle opposite the even side is rational. (c) Derive a parametric form for the 
primitive Pythagorean triples in (5). 


THE CONGRUUM 


A positive integer h was called a congruent number [not to be confused with 
‘congruence (mod m)’’] in the Middle Ages if there exists a rational number x 
such that x? + hand x* — hare both rational squares. Because the problem of 
determining all congruent numbers is very difficult, we will consider a related 
problem. 

We will call the positive integer k a congruum if the system of diophantine 
equations 


x+k=y? 
x? —k =2z? 


has a nontrivial solution. 


Problem 2. (a) If a, b, cis a Pythagorean triple, show that the number 
2ab is a congruum. (This result was known to the Arabs in the tenth century.) 
(b) Prove that each congruum is of the type found in (a); that is, if k isa 
congruum, there exists a Pythagorean triple a, b, c, such that k = 2ab. 

(c) If k is a congruum, prove that k = 0 (mod 24). Find the first few con- 
grua. 
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THE MAGIC HEXAGON 


A hexagonal array of the numbers | through k into k cells, such that all of 
the rows sum to the same number, is called a magic hexagon. The number of 
cells in a shortest row is called the order of the hexagon. A magic hexagon of 
order three is shown in Figure 11.1. 


Figure 11.1. Magic hexagon of order 3. (Copyright 1963, Scientific American.) 


In 1910 a railroad clerk, Clifford Adams, began searching for a magic 
hexagon of order 3. He found one only after a 47-year search and then, 
unfortunately, lost the paper on which the solution was written. After 5 years 
spent trying to rediscover the solution, he located the lost paper and sent it to 
Martin Gardner, editor of the “Mathematical Games”’ section of Scientific 
American. Gardner sent the solution to the mathematician C. W. Trigg, who 
was able to prove a rather astonishing fact: The hexagon found by Adams is 
the only magic hexagon of any order that exists (except for rotations and 
reflections of itself). It is, of course, the one shown in Figure 11.1. 

Trigg first reduced the problem to a diophantine equation involving the 
order which he proved had only the solutions 1 and 3. For order 3 he examined 
all logical possibilities and found no other magic hexagons. 


Problem 3. (a) Prove: If nis the order of a magic hexagon, the largest 
rows contain 2n — 1 cells each, the total number of cells is 3n? — 3n + 1, and 
the sum of the numbers in all of the cells is 

9n* — 18n? + 18n? —9n +2 
2 
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(b) Show that the row sum r satisfies the diophantine equation 


_ 9n* — 18n> + 18n? —9n +2 
7 2(2n — 1) 


Prove that this equation has only the solutions 2 = 1, r = 1 andn = 3,r = 38. 


CONTINUED FRACTIONS 
AND PELL’S EQUATION 


12.1 / INTRODUCTION 


In this chapter we consider the diophantine equation 
x? — Dy? =1(D¢0) 


Commonly called Pell’s equation after the seventeenth century mathematician 
John Pell, it probably should be called Fermat’s equation after the man who 
initiated its comparatively recent study. In one of his works, Euler mistakenly 
attributed the equation to Pell (who had no connection with it). 


Our plan of attack is based on the following considerations: If «/ D is an 


irrational number and a/b is a rational approximation to \/ D, then a?/b? is an 
approximation to D, say 


a’ 


C 
pe pees 


Multiplying by b*, we obtain 


a”? — Db* =c #0 


If a/b is a “‘ good’’ approximation to ap D (in some sense), we would expect c to 
be a small integer. If, in particular, c happens to be 1, we have a solution of 


Pell’s equation, and a/b is a “‘very good”’ approximation to yf D. Thus, the 
problem of solving Pell’s equation is closely associated with the problem of 


approximating a) D by rational numbers. 


As we will see later the best way to approximate ./ Dis by use of continued 
fractions. 


174 
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EXERCISES 


1. If we consider the decimal approximation of «/ 2 (/ 2 ~ 1.4142), we obtain 
the following sequence of rational approximations 


7 141 707 7071 
5’ 100’ 500’ 5000 


Which of these are “‘ good’”’ in the sense that a* — 2b7 = +1? 


2. Show that the following rational approximations to \/2 are all “good” in 
the sense of Exercise 1: 


12.2 / PRELIMINARY RESULTS ON PELL’S EQUATION 


Observe that if x, y is a solution of Pell’s equation, then so are x, — y; — x, 
y; and —x, — y. Because one of these pairs must contain nonnegative num- 
bers, we may restrict our investigation to nonnegative solutions. Observe also 
that if y = 0, then x = + 1; and if x = 0, we have no solutions except in the 
trivial case D = —1 when we have y = + 1. These trivial solutions will not be 
considered further. 

Two other cases can be ruled out at once: If D < 0, Pell’s equation obviously 
has no positive solution. If D is a square, say D = d?, the equation is 


x* — (dy)* =1 


If there is a positive solution, then 1 is the difference of two positive squares, 
which is clearly impossible. Thus, in this case we can have no positive solu- 
tions. 

In the remainder of this chapter we will consider Pell’s equation for the case 
where D is positive and not a square. We will obtain only the positive solutions 
of the equation. 

If Pell’s equation has a positive solution, it has a positive solution x,, y, for 
which x, + y, /D is a minimum. This solution is called the fundamental 
solution. The following theorem shows that this solution is well-named, for it 
can be used to generate all other positive solutions, 
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=» THEOREM 1 / If x,, y, is the fundamental solution of Pell’s equation, all 
positive solutions are of form x,, y, where 


Xn + ya D = (x, Ar yJ Dy" 
The solution x,, y, can also be computed from the formulas 
Xn = Hx + yr/D)" + (1 — yr/D)} 
1 os 
Va = 27D {(x, + y1/D)" — (x1 — yi,/DY3 
or from the recursive formulas 
Xn +17 X1Xp aa DY Yn 
Yat 1 = Xi Vn 7 XnJ1 


[When using the formula x, + Va D= (x, + yi DY", we find x, and y, 


by expanding (x, + y,J/ DY" and equating rational and irrational parts with 
x, and y,,.] 


PROOF: (Part I) Let x, and y, be defined as above. Observe that 
Xn yn\ D a (x4 a yi D)" 


[this follows easily from the expansions of (x, + yiJ Dy" and (x, — yiV Dy" 
by the binomial theorem]. Thus, 


Cie ni y,’D = (x, ~ yn D) (Xp a yn D) 
= (x4 — yr DY" (1 + yD)" 
=(x,7 —y,” D!/ =1"=1 


and so x,, y, iS a solution. 
(Part IT) Let x’, y’ be a positive solution of Pell’s equation. We must show 


that there is a positive integer n such that (x, + y, J D)'=x' + yf D. 
Suppose that no such z exists. Since x, + y,/ Dis the fundamental solution, 
there exists a positive integer m such that 


(x, + yy/ DY" < x’ +y'/D< (xy + yy D)"** (12.1) 
Since (x, - yr DY"(xy + yy Dy" =1 


then (x; - yi D)™ >0 
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Multiplying (12.1) by (x, — yi D)™ and combining terms, we obtain 
1 < (x! + yD), — yr DY" = (x + VD) Xm — Ym D) 
= (X'Xm — Dy’ Vm) + (Xm — x'y,,)/ D <X,+y/D (12.2) 
[t follows by direct computation that 
Xq = X'Xm — DY'Yms Vo = XmY' — X Vm 


is a solution of Pell’s equation. If we can show that it is a positive solution, we 
will contradict the fact that x,, y, is the fundamental solution. Since X9, yo isa 
solution, then 


Xo? — Yo? D = (Xo — Yo! D)(Xo + yoV D) =] 
From (12.2) we know that 
Xo + Yow D >0 
and so Xo- YoV D >0 


If we add these two inequalities, we obtain 


2X9 > 0 
Since Xo + Viens D>! 
then Xo — Yow D<! 


Subtracting these inequalities, we obtain 


2Vo/ D >0 


Thus, X9, Yo iS a positive solution, which is a contradiction. Therefore, each 
positive solution is of form x,, y,. 
The other formulas for x,, y,, are left for the reader (Exercises 3 and 4). 


m COROLLARY 1.1 / Ifx? — Dy? = 1 hasa positive solution, then it has an 
infinity of solutions. 


The recursive formulas for x,, y,, stated in Theorem | were known to the 
Hindu mathematician Bhascara (about A.D. 1140). 

In the following sections the theory of continued fractions will be developed 
as a means of solving Pell’s equation. 
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EXERCISES 


1. Show that the fundamental solution of x? — 15y? = 1 is x, =4, y, =1. 
Calculate x,, y, for n = 2, 3, 4. 


2. Prove that if Pell’s equation has a positive solution, then the fundamental 
solution exists. 


3. Prove: Xn = A{(x, + y1/Dy" + (x, - y1/D)"} 
1 _ = 
Vn = 2./D {(x, + y1/D)" (Xp = y1./D)"} 
4. Prove: Xn +1 = X1Xq_ + DYiyp 


Vn+1 = Xin + Xnay1 


5. Prove: If D is a positive integer which is not a square, then ./D is irra- 
tional. 


6. Verify that the pair xo, yo, defined in the proof of Theorem 1, is a solution 
of Pell’s equation. 


PROBLEM 


1. Suppose that x? — Dy” = —1 has a positive solution and x,, y, is the 


positive solution such that x, + viv D is a minimum. Define Xn» Vy, (n = 1, 
2 vee) by 


Xn + Ia D = (x, + YD" 


Prove that x,, y, is the fundamental solution of Pell’s equation, that all solu- 
tions of 


x? — Dy? =-1 
are given by x,, y, (” = 1, 3, 5, ---), and all solutions of 
x* — Dy? = 1 


are given by x,, y, (n = 2, 4, 6, 8, -:-). 
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12.3 / FINITE CONTINUED FRACTIONS 


a DEFINITION / An expression of form 


1 
ay t+ 


ay + 


where each a; (i = 2) is a nonzero complex number, is called a finite continued 
fraction. If all of the a’s are integers and all except possibly a, are positive, the 
continued fraction is called a finite simple continued fraction. 


Rather than use the cumbersome fraction notation, we will designate the 
above continued fraction by the symbol! 


<1, 2, ***s Aq) 
For example, 


(—1,1,2,3,45=—-1+4 


I 


(fe SS 


Obviously, any finite simple continued fraction represents a rational num- 
ber. We will now prove that the converse of this statement is also true. 


=» THEOREM 2/ Any rational number can be written as a finite simple 


continued fraction. 


1Other notations in common usage are [a1, @2, ---, @n] and 
1 1 1 1 
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PROOF: Let a/b bea rational number. We may assume that 5 > 0. From the 
Euclidean algorithm we obtain the following chain of equations: 


a=a,b+5, 0<b,<5 
b=a,b,+ 5b, 0<b, <5), 
b, =a3b, + b; 0<6b,<b, 


b, - 3 = 4, —~15,-2 +5, -1 0<b,-1<b,-2 


Dy — 2 = Andy - 1 


Because the b’s are positive integers, it follows that a,, a3, ---,a, are also 


positive integers. Rewriting these expressions, we obtain 


7 qa 
b -* b 
a p22 
b 07, 
by b3 
—_ = + — 
by > dy 
b, - 2 a 
Big 8 
By successive substitution 
—=q _ — 
b saan ae eo 
b, b, 
b, 
1 
Sra eee i = (a4, Az, °**, AyD 
az+° 1 
a, 


It is natural to inquire at this point whether the simple continued fraction 
expansion of a rational number is unique. It is easy to see that it is not, for if 
a, > 1, then 
(Ay, Az, ***, ay) _ (ay, Az, °°» A, — I, 1) 
while if a, = 1, then 

<a, Az, °*"s a,» =F <a, 2, °°°5 Ay — 4 + I> 
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It follows from this argument that a rational number can be expressed as a 
simple continued fraction in at least two different ways. It turns out that if we 
require the expansion to terminate with the number 1, the expansion is 
unique. The proof of this is left for the reader (Problem 1). We summarize 
these results in the following theorem. 


a THEOREM 3/A rational number can be written as a finite simple 
continued fraction in exactly two ways, one with an odd number of terms and 
one with an even number of terms. 


a DEFINITION / A continued fraction formed from <a,, a2, -::,a,> by 
neglecting all of the terms after a given term is called a convergent to the origin- 
al continued fractions. The first convergent is <a,>, the second convergent 1s 
<a;1, a>, +++, the nth convergent is ay, az, --:,a,>. We will denote the Ath 
convergent by the symbol C,. 


Therefore, 
a 
C,=<ay>= = 
1 a,a,+1 
Cz = (a4, a2) = ay a eer aa 
Z 2 (12.3) 
(a,a,+ 1l)a, +a, 
C = 9 5 = a eS TS 
3 = (ay, 42, a3) oa aE 1 Rae 


a3 


1 . 1 
Cdn gas es eee eae 
(42a, + 1)'a,+a, 


To simplify the above expression, we define two sequences inductively. (The 
sequences depend on the continued fraction (a,, a), --:, a,.) 


Po=! QO, = 90 

Py =a, Q,=1 

Py, =a,P, + Po QO, = 4,0; + Oo = a 
P3 =a3P, + Py QO, =a30,+Q; 
P,=a,P; + P2 0, =a,23; + Q2 


P, = a,P,-1 + Py 2 


Py = A,Py- 1 ae eee 


QO, = 4,Q; —-1 + OQ - 2 


On = Qn -1 + Qn - 2 
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If we substitute these values in (12.3), we obtain 


P, 
es a 
P 
Cz = (a4, a2) = a 
P, 
C = (€A;, 42, a3,> = — 
3 12 42, a3 0, 
P, 


Cy = (a1, 42, 23, Ag) = 


(Qs 


_ 


We will now prove that this relationship always holds. 


= THEOREM 4/ If C,is the kth convergent to <a, a>, ---, a,>, then 
Py 


C.,= 
Ob 


PROOF: We have shown the proposition to be true when k = | (and 2, 3, 4 
as well). Assume it is true for the integer k < n. The k + Ist convergent is 


Cy + ho CA, Az, *°*, Ay ~ 15 Aky Ay + > 


1 
= (a, Ar, °°*5 Ay — 15 Ay + 
An+1 


This expression for C, , , is the Ath convergent to a different continued frac- 
tion, call it C’,. We will denote the numerators and denominators of the con- 
vergents to this new continued fraction by P’; and Q’;, respectively. By our 
inductive assumption we see that 


1 
~P", _ Pc 
P’;, (a +=] k-i TE ko? 


C43 = Cy, = = 
o (a + )-On-1+O%-2 
QAn+1 
, ta 1 ¢ 
a,P),-14+ Py at a eae ees 
= An+14 
, ‘A 1 ¢ 
UQe-1 + Qn-24- ‘O77. -1 
k+4 


If we compare the convergents of the two continued fractions we see 


that P, =P’, and 0,=97; (j =0, l, 2: +k —1) 
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Thus 
(a,P,—-1 + P,-~2)+ Prey Pat Pye 
QAn+1 An+1 
Cha) =———___*4-—__- 
(a,Q,-1+Q,-2)+ Q,-1 G+ O.-1 
aAn+1 An+1 
— 44 Pet Pe-1 _ Pas 
Qn41On+Q.-1 Deva 
P 
Therefore, C,=— (k = 1,2, ---, n). 
k 
EXERCISES 


1. Calculate P;, Q;, C; (j = 1, 2, «--, 6) for the continued fraction <1, 2, 2, 2, 2, 
2». Note that the sequence C,, C, ---, Cg is the sequence of approximations to 
/2 given in Exercise 2 of Section 12.1. 


2. Express 179/127 as a continued fraction with 6 terms. 
3. Express 179/127 as a continued fraction with 7 terms. 


4.(a) Calculate P; and Q, for the continued fraction in Exercise 2. Show that 
x=Q,;, y = —P, is a solution of 179x + 127y = 1. 

(6) Calculate P, and Q, for the continued fraction in Exercise 3. Show that 
x = 06, y = —P,isasolution of 179x + 127y = —1. " 


PROBLEM 


1.(a) Prove that the representation of a rational number as a simple 
continued fraction is unique if we require the last term to be 1. [Hint: Use 
mathematical induction. } 

(b) Use (a) to prove that a rational number can be written as a simple 
continued fraction in only two ways. 


12.4 / PROPERTIES OF P, AND Q, 


The reader has probably noticed that in the exercises of the preceding section 
the following pairs of numbers are relatively prime: P, and Q,, P, and P,,+,,; 
QO,,and Q,, , ,-. In this section we will prove a basic theorem. As corollaries we 
will show that the above pairs of numbers are always relatively prime, and 
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that if (a, b) = 1, the convergents to a/b can be used to solve the linear dio- 
phantine equations 


ax+by=1 and ax + by = —1 


(see Exercise 4 of Section 12.3). The most important use of the theorem will be 
in establishing an order relationship among the convergents. 


a THEOREM 5 / Ifk = 1, then P,Q, _ ; as O,P, — 3 = (—1)*, 


PROOF: The proposition can be verified directly from the definitions of the 
sequences {P;} and {Q,} for the case kK = 1. Assume that the proposition is 
true for the integer K. Then 


Pe41Qx— Qx+1Pe = (Gx 41P K+ Pr -1s)Ox — (4x 410K + Ox -1)Px 
= — (PxQx-1 — OxPx-1) = —(-)* =(-)**? 


From the principle of mathematical induction it follows that the proposition 
is true for each positive integer k. 


aw COROLLARY 5.1 / If = 0, the following pairs of integers are relatively 
prime: P, and P,,,, Q, and QO, 4,,P, and Q,,. 


PROOF: The greatest common divisor of P,,and P,, , , isthe smallest positive 
integer that can be written as a sum of multiples of P,, and P,, , ,. From Theo- 
rem 5 we see that 


P, +1(—1)"**Q, + P,C— 1)" * 70, + 1 =] 
Thus, (P. 4 4s-P,) =1 


The other two proofs are similar. 


m» COROLLARY 5.2 / Let a and 5 be relatively prime integers, b > 0. If the 
simple continued fraction expansion of a/b has n terms, then 


x=0,-1,yV= “~ 4An-1 
is a solution of the linear diophantine equation 


ax + by =(-—1)" 


PROOF: C,, = P,/Q, = a/b. Since both a/b and P,,/Q, are reduced to lowest 
terms, then a = P, and 5 = Q,,.. By Theorem 5, 


aQ, -; + (—P, ~ 1) =(— 1)" 
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We have established that a/b can be written in two ways as a continued 
fraction: One with an odd number of terms and one with an even number of 
terms. Thus, Corollary 5.1 provides us with solutions to both 


ax + by =1 and ax + by = —1 


We will now use Theorem 5 to establish an order relationship among the 
convergents. This relationship is of basic importance in the theory of infinite 
continued fractions. 


=» THEOREM 6 / If C, is the kth convergent to the simple continued fraction 
{;, 4, °°*, a,», then 
C,<C3<C5<-- <Cpg<Cy<C, 
PROOF: Using Theorems 4 and 5 we obtain 
Cy 2 — Cy=(Crn 2 — Cras) + (Cas — Ce) 
_ (+2 - +4) (a - =) 
Q, oie Q, +1 Q,, eek Q,, 
(-1)'*? (-1**! 
— Qe+ 2Qe+1 Os 1% 


_(=DF* 1, +2 — Ov) 
Q,Q% + 1O% + 2 


Since Q, + 2 — Q; > 0 (see Exercise 1), the sign of C, . , — C, is the same as 
that of (—1)* *!; that is, C, , , > C, if k is odd, and C, , , < C, if k is even. 
Thus, 

Ci < C3 < C5 <-::: 
and 


C$ C3 CoS: 


We now must prove that if m is odd and 7 is even, C,, < C,,. Since C, > C, 
>C.>-- >C,, it is obviously no restriction to assume that n > m. Using 
Theorem 5 we obtain 


(— 1)" 
2,2, rik 


>0 


Co OC,24 = 


Because m and n — 1 are both odd, 


C=C, 4 =, 
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EXERCISES 


1. Prove that O,,,> Q,>0ifk = 1. 


2. Generalize Corollary 5.2 by removing the restriction that a and 5b are 
relatively prime. 


12.5 / INFINITE CONTINUED FRACTIONS 


Let a,, a2, a3, --- be an infinite sequence of real numbers. We would like to 
define the expression (a,, a>, ---> in a meaningful manner. Note that each of 
the finite continued fractions <(a,>, (a, 42>, °**, (41, A, °**, A,» °**, 1S defined, 
provided a; # 0 (i = 2, 3, ---). Thus, the natural way to define <a,, a,, ---) 
isas lim a,, a2, -:-, a,», provided the limit exists. 


noo 


We will now prove that in an important special case the above limit does 
exist. 


m= THEOREM 7/ Let a,, a,, --- be an infinite sequence of integers, all 
positive except possibly a,. Let C, = <a,, a2, ---, a,>. Then lim C, exists and is 
finite. oon 


PROOF: We may consider each C, as one of the convergents to a finite 
simple continued fraction. By Theorem 6, 


Ci, < C3 < C5 < °° < C6 < Cy < C, 


The sequence C,, is a bounded, decreasing sequence and thus has a limit «. 
Since each C,, is greater than each C,,, , then « is greater than or equal to 
each C,, ,,. Therefore, since lim Q, = 00 (see Exercise 4), 


k-— 0o 
(~1)* 
O< lim ~ Cons 1) < lim (Co ~ Cons 1) = lim 5 -— 
k > © ko O24Qo, 414 


= 0 


Thus, 
lim C3 4 = “= lim C5 


k-> © k— © 


and the sequence C,, converges to the real number a. 


= DEFINITION / Let a,, a,, --- be a sequence of integers, all positive except 
possibly a,. The expression 


<a, a2, ae) 
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is called an infinite simple continued fraction and is defined to be equal to the 
number 


lim (a1, @>,°**, A,» 


n-> 0 


Let « = <a, a>, --->. Since a is between C,, and C,, , , for any two consecu- 
tive convergents, we obtain the following theorem: 


=» THEOREM 8 / If C, is the nth convergent to the irrational number «, then 


la —-C,| < 


1 
2,2, +1 
Example I. 
Evaluate « = <1, 2, 2, 2, --->. How close is the approximation C.? 


SOLUTION: We write 


1 1 
*“IFTEG LR) ite 
Thus, q*=2 
so that a=V2 or a=-v2 
Since a>Ci>0 
then a=Vv2 


Since Ce = 99/70, Qc = 70, O7 = 169, then 


V/2 Zz <— < 0.00009 
~ 70! ~ 70169 ~ 


= NOTATION / It is customary to write continued fractions of the type 
considered in Example 1 with a bar to indicate the repeating block. Thus, 


<1, 2, Zs 2 — » = <1, 2» 
<1, Pe 3, 4, 1, Z, 3, 4, 1, Zz op 4, soe = cl, 2, 3, 4), 


and so on. 


- Example 2. 


Write V3 and — V3 as infinite simple continued fractions. 
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SOLUTION: 
@) V¥3=1+(V3—1)=1+4 
V3—1 
1 _V341 14+ V3_24+(V3-)_, i 
‘Bai 61° (f° ee 
V3—1 
Dae VB yy tye dy lp 


V3=1° #31 


V3—1 
From the last two equations, it follows that 
—_ 1 1,2 : | : 
Vest COV SST OAS ONT Ne 
= = (1, 2,1,2,1,2,--> = <1, 2 


Therefore, 
= 1 = 
V3=(1,—-=— ) =<1, 1,2 
( va—1/ * 4 


(b) eee, ae ey ee ee 


2— v3 
1 _24+V3 


a Oe wee ee ee ea 


V3 = 


In (a) we established that 


V3—1 
In the above examples we made the implicit assumption that 


1 


(Q4, Az, A3, °°) = ay + -—— 
{a2, 43, °**> 
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This important fact, which is trivial for finite continued fractions, must be 
proved for the infinite case (see Problem 1). 


= THEOREM 9/ Any irrational number can be written as an infinite con- 
tinued fraction. Furthermore, the representation is unique. 


PROOF: Let a be an irrational number. We write 


| 
a= [a] + {a} = [a] + — 
fo} 


where {a} represents the fractional part of «. Because a is irrational, then 
1/{«} is irrational and greater than 1. Let 


a, = [a] and == 
We now write 
1 
a, = [o,] + {o,$ = [oJ + aria 


{a;} 


where 1/{a,} is irrational and greater than 1. Let 


1 
a, = [4,], a2. = fas} 


We continue inductively: 


1 Basis 
a, = [a], a, = San} >I (a, irrational) 
2 
1 
a, = [a3],0, = fa} >1 (a, irrational) 
3 
1 
a= [62% },.4.— fo, >1 (a, irrational), 


On — i$ 


Since each «, is greater than 1, then a, > 1 (n = 2, 3, ---). If we substitute suc- 
cessively, we obtain 


© = (Ay, Hy) = (Ay, Az, M2) = (Ay, Ar, 23,3) = 


= Cay, Qo, ore ano a,» = eee 
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We will now show that « is equal to (a,, a5, --->. Note that C,,, the nth conver- 
gent to (a,, a2, -°--> is also the mth convergent to <a, a5, ---, G,, %,>. If we 
denote the (n+ 1)st convergent to this finite continued fraction by 
P’,+1/O'n+1 = %, then, by Theorem 5, 


Piet. cao 


Since Q, and Q’, , , become infinite as n > oo, then 


(-1y"*! 

lim (« — C,) = lim ———— = 0 

n> 00 n> «0 Q'n+1Qn 
and a= lim C, = (a,, a2, °--> 


The uniqueness of the representation is left for the reader (Problem 2). 


We have proved that each rational number can be written as a finite simple 
continued fraction, and each irrational number as an infinite simple continued 
fraction. It is conceivable that certain rational numbers can also be written as 
infinite simple continued fractions. The next theorem indicates that this is not 
the case. 


= THEOREM 10/ If « is an infinite simple continued fraction, then « is 
irrational. 


SKETCH OF THE PROOF: Show that if « =a/b, then |aQ, — bP,| is an 
infinite sequence of positive integers that converges to zero. 


We raise one additional question in this section: Let « be a positive irra- 
tional number. Is there a relationship between the convergents to « and the 
convergents to 1/«? It would be natural to conjecture that they are reciprocals. 
A more precise statement of the relationship is stated in the following theorem. 
The proof is left for the reader (Problem 3). 


m= THEOREM 11 / Let o« be an irrational number, greater than 1. The 
(kK + 1)st convergent to 1/a is the reciprocal of the Ath convergent to a (k = 1, 
2: 2685, 


EXERCISES 


1. Evaluate 
(a) <1, 2, 2; 
(b) <1, 2, 3, 4). 


2. Express ,/15 and 2 — ,/15 as continued fractions. 
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3. Calculate the first six convergents to the continued fraction expansion of 
=) 27 and compare them with the decimal approximation to af 27 given in a set 
of mathematical tables. 


4. Prove that lim Q, = oo. 


n-> 00 


PROBLEMS 


1. If « = (a,, a, --* >, prove that a, =[a] and 
1 
(42, @3, °**) 
2. Complete the proof of Theorem 9 by proving: If 
& = (dy, a2, -*- ) = <b, ba, «+ > 


a, = b, (n = 1, 2, 3, ---) 


a=a,t+ 


then 


[Hint: Use Problem 1.] 
3. Prove Theorem 11 by mathematical induction. 


The Fibonacci Numbers. The following problem refers to the famous sequence 
investigated by Leonardo Pisano (Fibonacci) in the thirteenth century. The 
Fibonacci numbers are is defined as follows: 


fi=l 
Sf, =2 


fs =hi tho =3 
0 ee ee ne ot 


4. Leta = <1 » and let C, = P,,/Q,, be the nth convergent to a. 


(a) Show that P, =/,. Can Q, also be expressed in terms of the Fibonacci 
numbers ? 
1 5 
(b) Prove: lim Jn = ee es 
n> OjJn—-1 
1 (/14+/5\Ft! “1 — /5\ett 
Prove: f, =—= — |——_— 
Mo rioea ral 2 ) ( 2 ) | 
(d) Prove: If <a,, a,, --- > is an infinite simple continued fraction with nth 
convergent C,, = P,/Q,, 


then 0,2Sn-1 
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12.6 / THE CLOSENESS OF APPROXIMATION 
BY CONVERGENTS 


We have proved that if P,/Q, is the kth convergent to the irrational number 
a, then 


P, 1 
gC- 


Q, 2,02, +1 


In this section we will prove several similar inequalities. 
Note that the above inequality implies the weaker relationships 


< 


Pp. 1 
*~ 9,1 0,? 
and |O,« — P,| < a 
Q, 


This last inequality implies that if « is an irrational number, there exists an 
infinity of positive integers b such that ba is arbitrarily close to an integer. 
We will now prove that if ba is “close’”’ to an integer, then b must be fairly 
large. More precisely, we will prove that of all numbers ba, where 1 < b < Q,, 
Q,, is the closest to an integer. 

=m THEOREM 12/ If aand 5d are integers, 1 < b < Q,, then 


[ba — al > |Q,« — P,| 


PROOF: Consider the system of equations 
PX P65) =o 
Q,X sg 0, +1 Y ame b 


Because the determinant of coefficients is (— 1)" (by Theorem 5), the system 
has a unique integral solution Xo, Yo. Since Q,,,2Q0,> 6, then X, and 
Y,> must have different signs and neither can be zero. Thus, 


[ba — al = |(Q,Xo + Qn + 1 Yo)% — (PrXo + Pu + 1 Yo)! 
= |Xo(Qne — Pr) + YoCQn + 1% — Pr+1)I 
= |Xo(Qn0 — Pa)l + | YoCQn + 1% — Pa+ I 
> |Xol 1Q.% — Pal = |Qn% — Pal 


We will use Theorem 12 to show that the convergents form a sequence of 
“best” approximations to the irrational number a. 
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= COROLLARY 12.1 /If1 <b < Q,, then 


a ssl Ps 
g—-- —a 
b Q, 
a n 
PROOF: If t=) a ——!, then 
n L 
lbx —a| <b: ja——|<Q,°: to = |Q,a — P,| 


which contradicts Theorem 12. 


Although theorems similar to Corollary 12.1 are useful, they have the draw- 
back that the denominator must be compared with the sequence Q,. We will 
prove a similar result in which the difference between « and a/b must only be 
compared with a function of b. The main result is that any “close”’ rational 
approximation to « must be one of the convergents to a. 


= THEOREM 13 / Let « be an irrational number. If a/b is a rational number 
with positive denominator such that 
Z 1 

2b? 


ee 
b 


then a/b is one of the convergents to «. 


PROOF: We will only prove the theorem for the case where a and b are 
relatively prime. The trivial extension to the general case is left for the reader 
(Exercise 1). 

Let a/b — a =€0/b* where e=+1 and 0<0@<14. Expand a/b as a 
simple continued fraction obtaining a/b = (a,, a, ---, a,>. By Theorem 3 we 
may choose 7 so that 


€=(-1)" 
Let P,/Q, be the kth convergent to a/b and define B by 
spac Bae Roce 
BQ, + Qn - 1 


Obviously, £ is real and irrational. Since a and 5b are relatively prime, then, 
using Corollary 5.1, we find that a = P, and b = Q,. Thus, 


60 «€8 a BP, + Pros (—1)" 


0, wb »b 0, BQ,+Qn-1 Q(BQ,+ Qn - 1) 
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If we solve this equation for B, recalling that « = (— 1)", we obtain 


Q, 


the inequality holding since 1/9 >2 and Q, = Q,-_,. Since 6B > 1 we can 
write B as an infinite simple continued fraction 


B = <b, bz, -*-> b,2>1 
Consider the simple continued fraction 
Y = (Ay, Aa, ++, Ay, By, Ba, +++) 
eae os ae 
OB+ One. 
then a/b = P,,/Q,, is the nth convergent to «. 


Since py = (a1, 42, °°°, a,, > = a 


We will now show that the inequality 


2 1 
2b? 


a 
a-- 


b 


is characteristic of the convergents to a. 


=» THEOREM 14/ Let « be an irrational number. Of any two consecutive 
convergents to «, at least one satisfies the inequality 


a—fl< : 
b| 2b? 
PROOF: Suppose 
P, 1 Poi & 1 
a—-—|> and SS 
Q, 20,” QO, +1 20, + i 
Since the convergents are alternately less and greater than «, then 
1 P, P,, cae ie P,, euaee 
SSS ee =|{—_ — @ + g—-—-—— 
0,2, TA Q, Q,, +1 Q,, Q,, +1 
P : | eran 1 n 1 
=|—— 4 a—-———| > —__——,, 
0, OQ, +1 20," 20, + ‘a 
= Q,, + i —- Q,° 
20,0, + 7 


From this inequality we see that 


(Q,41— Qn)? = Qn +17 — 20,0n+1+2,7 <0 
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which 1s impossible unless n = 1 and a, = 1. In this special case, 


0,=92,=1 and Q0,;=a,+122 
so that 


P P P 1 1 


Q, Q, Q; Q0; 2 


which is a contradiction. 


EXERCISES 


1. Find three integers b,, b,, b3 such that the distance from b,,/ 3 to the nearest 
integer is less than 1/20. 


2. Complete the proof of Theorem 13 by extending the proof to cover the case 
where a and b are not relatively prime. 


PROBLEM 


1. Let € be a positive number. At each lattice point in the plane draw a circle 
of radius, €, which has the lattice point as center. Prove that any straight line 
through the origin must intersect an infinity of these circles. 


12.7* / SIMULTANEOUS APPROXIMATIONS 


In the preceding section we obtain a sequence of rational approximations 
to a given irrational number a, satisfying 


2 1 
2b? 


a 
“Z-- 


; (12.4) 


Suppose we are given several irrational numbers, say «,, «2, ---,@,. Is it 
possible to obtain rational numbers a,/b, a,/b, ---, a,/b, having the same 
denominator, such that each satisfies an inequality of form 


< f(b) 


a: 
a;~ 
b 


where f(b) is a function similar to the right-hand side of (12.4)? It is obvious 
that this problem is closely related to the problem of finding an integer b such 
that each of the numbers 

ba,, ba,, «++, ba, 
is “‘close”’ to an integer. 
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In solving this problem we will not use continued fractions and, conse- 
quently, we must be satisfied with existence theorems. We will, however, be able 
to relax our restrictions and allow a,, &>, ---, «, to be arbitrary real numbers. 

As in Section 12.5 we let {X} denote the fractional part of the real number 
X; that is, {X} = X — LX]. We also define (X) to be the undirected distance 
from X to the nearest integer. For example, 


{134} =%4, (14) = (184) = 
u LEMMA / Let N be an integer greater than 1. If {x} and {y} are both 
between k/N (inclusive) and (k + 1)/N (exclusive), then (x — y) < 1/N. 
PROOF: Since (x — y) =({x} — {y)) =({)} — {x}), we need only consider 
the case where {x} > {y}. In this case 


k+1 k 


0<f}-}<— Fae 


Therefore, zero is the integer nearest to {x} — {y} and 


(— ) =} - OY = 63 - OF <5 


=» THEOREM 15 / Let a, «2, ---, «, be real numbers, let N be an integer 
greater than 1. There exist integers a,, a5, ---, a,, b, 1 < b < N”, such that 


1 
lba;—ajl<a (j= 1,2, --, n) 


PROOF: We define a “‘cube” in n-dimensional space by 
O<x,<1 (Vj = 1, 2, ---, n) 
and partition it into N” small cubes, defined by 


k k+1 
= Sx)< (kK =0,1,---,N—1;j =1,2,---,n) 


J 


Consider the following points in n-dimensional space: 
(0, 0, oe) 0), 


(ott {oats 7° Lond) 
(4201 $5 {242$, -°> {2nt), 


({N"a,,3, {Na}, «++, {Na }) 
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These N" + 1 points are all contained in the large cube. Since there are N” 
small cubes, at least one of them contains two or more of the points, say 


({ka}, {hoo}, ++, {kor,f) 
({a, 3, {raz}, +++, {ma,}) 


where m > k. If we now let b = m — Kk, it follows from the lemma that 


and 


1 
(ba ;) = (ma; — ka;) < N (j = 1,2, +++, n) 
that is, there exist integers a,, a2, ---, a, such that 


1 
lba;- al <= (j = 1, 2,--, n) 


In the special case where N equals the larger of 2 and [1/e] + 1, we obtain 
the following corollary: 
mw COROLLARY 15.1 / Let «,, «,, ---, «, be real numbers, let « > 0. There 
exists an integer b such that the distance from each of 
ba,, ba, °°, ba, 
to the nearest integer is less than e. 


Since 0 < 1/N < 1/b*/" we also obtain the following result which is similar in 
form to the statement of Theorem 14. 


m COROLLARY 15.2 / Let a,, «5, ---, «, be real numbers. There exists an 
infinity of sets of integers a,, a>, ---, a,, b, b > 0, such that 


1 
< b™ + 1)/n 


a; 
J 
Saeates a; 


b (j=1,2,---, n) 


12.8 / PERIODIC CONTINUED FRACTIONS 


m DEFINITION / An infinite simple continued fraction 1s said to be periodic 
if it is of form <a,, a2, °°, a, 51, bz, ---, b>. If it is of form <b,, 5, ---, b,,>, 
it is said to be purely periodic. The smallest positive integer m satisfying the 
above relationship is called the period of the expansion. 


In the examples considered so far in this chapter the infinite simple con- 
tinued fractions have all been periodic and have been roots of quadratic 
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equations with integral coefficients. In this section we will prove that this is 
always the case. 


mw DEFINITION / A real irrational number which is the root of a quadratic 
equation with integral coefficients is called a quadratic irrational. 


Observe that the above definition is equivalent to saying that a quadratic 
irrational is a real irrational root of a quadratic equation with rational 
coefficients. 


m LEMMA / Each purely periodic continued fraction is a quadratic irra- 
tional. 


PROOF: Let = Cay, QA, °°", a,» = a1, a>, ead an a». If P,/ QO, iS the kth 
convergent to «, then 


a a Ont Pa et 
aQO,+Q,-1 
so that 0,07 ++(O,-,—P,)«—P,-, =0 


Thus, since « is not rational (the continued fraction expansion is infinite), a is 
a quadratic irrational. 


In the proof of the next theorem, we use the following result from elemen- 
tary algebra: If « is a quadratic irrational and a, b, c, dare rational with not 
both c and d zero, then 

a+ ba 
c+ da 


is either rational on a quadratic irrational (Exercise 1). 


uw THEOREM 16/ Each periodic simple continued fraction is a quadratic 
irrational. 


PROOF: Let « = (ay, a, -++, ay, 51, bo, ---, b>, and let B = <b,, b5, «++, by». 
From the lemma we see that f is a quadratic irrational. If P,/Q, is the kth 
convergent to a, then, 


BP, a9 ie | 
BQ, ir QO, aaa | 
It follows from the remarks preceding the statement of the theorem that « is a 


quadratic irrational or a rational number. Since «& is an infinite simple con- 
tinued fraction, then a is a quadratic irrational. 


a= Ca, "°*5 Any B> ie 
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Before proving the converse of Theorem 16, we must establish some supple- 
mentary results. 


= THEOREM 17/ Let a be a real, irrational root of ax? + bx +c =0, 
where a, b, c are integers. If 


= CA, A, *°*, Any Uy 
then «,, is a root of the polynomial 
A,x? + Bx +C, =0 
where A, = aP,,” + bP,Q, + cQ,’ 
B, = 2aP,,P y - 1 + bPyQn - 1 + OPy - 1 Qn + 2CO,Qy - 1 
C, = aP, — 17 + bP, 1On—-1 + €O,— 1? 
and where B,”? — 4A4,C,, = b? — 4ac 


PROOF: If P,/Q, is the nth convergent to «, then 


bt Peed 
Q,en + On - 1 


If we substitute this value of « into ax” + ba + c = 0 and rearrange the coeffi- 
cients, we obtain 


A,o,” + B,%, + C, =0 


where A,, B,, C, are as given in the statement of the theorem. By direct 
computation we obtain 


B,” — 4A,C, = (b? — 4ac)(P,Qn- 1 — QnP, - 1)” = b* — 4ae 
m COROLLARY 17.1 / Let « be a quadratic irrational and write 
1 = €Ay, Ag, °°) Ans PY = (Nn = 1, 2, «-*) 
There exists a finite set of polynomials with integral coefficients 


A,x? + B,x+C, =0 
A,x? + Byx + C, =Q 


Ayx? + Byx + Cn = (0 


which have all of the «, as roots. 
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PROOF: Let «, be a root of A,x? + B,x + C, =0 where the coefficients are 
as given in Theorem 17. Since 

|2Q, — P,| < 

n n QO, 


we can write 


€ 
P,=0«0,+— lel <1 
Q, 
(where ¢, of course, depends on 7). Substituting this into the expression for A,, 
we obtain 
€ 


ne a(0, ms <) + b( 10, a 


Jon + 60," 


= (au + ba + c)Q,”? + (2a + b+ 5] € 


= (20 + b+ 5) -« 


Thus, |A,| < |2ax + b| + |a| 


which implies that all of the 4, must come from a finite set of integers. Since 
C, = A, — 1, we have a similar result for the C,. Since 


B,” — 4A,C, = b? — 4ac 
then 
B,? = |4A,C, + b* — 4ac| < 4{|2ax + b| + |a|}* + |b? — 4ac| 


and B, is also bounded. Because we have only a finite number of choices for 
Ans Bas Ca, We have only a finite number of distinct polynomials 


A,x* + B,x+C,=0 


which have all of the «, as roots. 


m THEOREM 18 / (Converse of Theorem 16.) Each quadratic irrational has 
a periodic expansion as a simple continued fraction. 


PROOF: Let « = (a), --:, @,, 4,» be a quadratic irrational. Since each a, is a 
root of one of the quadratic equations 


A,x? + Byx + Ce = Q 
A,x* + Bx +C, +0 


Ayx? + Byx + Cy =0 
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then one of these polynomials must have at least three of the «, as roots. Since 
a quadratic equation can have at most two distinct roots, then two of these 
a, must be equal, say 


Oy = Oe+¢ 
By the algorithm developed in the proof of Theorem 9, it follows that 


Qe41 ~ Art ti1 


Q+2 =~UA+1+2 


Ae+ jp ~U+ t+; 


Thus, 
“= <a, 2, °**5 Aes Ay 4 to °°'s Ak + » 


We have established that an irrational number has a periodic expression as a 
continued fraction if, and only if, it is a quadratic irrational. We will now 
investigate those quadratic irrationals that have purely periodic expansions. 


= DEFINITION / Let « =a + b,/D where a and BD are rational and JD is 
irrational. The conjugate of «, denoted by «’, is the number a — b ui D. 


u THEOREM 19 / Let a be a quadratic irrational. The simple continued 
fraction expansion of « is purely periodic if, and only if,a > land —1 <a’ <0. 


PROOF: (Part I) Suppose « = <a,, a>, ---, a,». Since each of the a’s is a 
positive integer, then « > 1. In the lemma preceding Theorem 16 we established 
that « is a root of 


F(x) = Q,x7 + (Qn - 1 — Py)X — Py -1 =0 
Since f(—1) = (Qn — On - 1) + (Pn — Pa - 1) > 9 
and fO) = —P,-, <9 


then «’, the other root of f(x) = 0, must be between — 1 and 0. 
(Part II) Suppose « is a quadratic irrational, « > 1, and —1 <a’ <0. Let 
>. = a. If we develop the continued fraction expansion of « as in the algorithm 
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in the proof of Theorem 9, we obtain (using the fact that the conjugate of the 
sum equals the sum of the conjugates, etc.) 


Ao = ay + — Og = Ay + — 
Oy 4 
0, =a, +— 0’; =a, + —- 
05) a2 
ne | ; 1 
Oy = Apa + Op=H— Asi ta 
O44 Ori 


We established in the proof of Theorem 9 that a, > 1 (kK = 1, 2, ---) and we 
know from the hypothesis that a) > 1. Thus, a, > 1 for all nonnegative k. It 
can be proved easily by mathematical induction that 


—~1<a',<0(k =0, 1,2, +) 


' 1 
Since 0, = Ape 1 ta 
Ak+1 
1 
then 0< -— —Ay41,<1 
Ok+1 


1 1 
Thus, [ - ; ]-a+1-[- ; = a4] =0 
Ak+1 Ok+14 


which implies that 
[ | 
On41 =] -— 7 
Ok+ 1 


Since « = (@;, 42, °°*, @,, %,» (n = 1, 2, ---) and & is a quadratic irrational, we 
know there exist positive integers k and ¢ such that 
Oe = Oe +t 


1 1 
Thus, a -|-=]-|- ]=2 
: OX Xk+et eee 
1 


and O14 =A, F— = Agee t+ 
OK Ok +t 


= Oe+t-1 
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If we apply this process k times, we obtain 
Oe =O +t 
Oe —-1 = 42-1 


Op —-2 ~%+24-2 


A= = 4, 


Therefore, a = (Aj, °°*, Ay, > = (Ay, °°", A> 


mw COROLLARY 19.1 / If the positive integer D is not a square, then 
JD i (ay, Az, ***» Ans 2a,> 
where a,= [L/ DY] 


PROOF: Let a =./D + [./D]. Since a > 1 and —1 <a’ <0, the continued 
fraction expansion of « is purely periodic and the first term in the expansion is 


[a] = 2[./D] 
Thus a = /D+ L/D) =<2L/DI, ag, +, dp 2/DD 
and JD =a-L/D) = L/D), a, +, ay 2/ DIY 


EXERCISES 


1. Let « be a quadratic irrational. Let a, b, c, d be rational numbers, not both c 
and d equal to zero. Prove that 

a+t ba 

c+ da 


is either a rational number or a quadratic irrational. [Hint: rationalize the 
denominator. ] 


2. Prove: (a + B)’ =a’ + PB’, (aB)’ = a’f’, and (a/B)’ = a'/P’. 


3. Expand as simple continued fractions: 


(a) /33; 
(b) ./47; 
(c) /94. 
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PROBLEMS 


1. Let m be a positive integer, D = m? + 1. Prove that 
/D = <m, 2m) 
2. Let m be an integer greater than 1, let D = m? — 1. Prove that 


JD =<m—1, 1, 2(m— 1)> 


12.9 / APPLICATION TO PELL’S EQUATION 


The first result in this section shows that any solution of Pell’s equation can 
be obtained from the convergents to si D. 


a THEOREM 20 / Let D be a positive integer that is not a square and let n 
be an integer that is less that /D in absolute value. If x9, yo is a positive 
solution of 


x7—y*D=n 


then X9/Vo is one of the convergents to «/ D. 


PROOF: Suppose n > 0. Since Xo, Yo is a positive solution of x? — y?D =n, 
then 


(Xo —Yor/ D\(Xo + Yor D) =n 
which implies that 


Xo > Yo./D 


Therefore, 


Yo Yo(Xo + Yo\/D) Yo(Vox/D + Yo./D) 


It follows from Theorem 13 that x9/yp> is a convergent to of D. 
If n < 0, then by a proof similar to the above, we find that yo/x> is a con- 
vergent to 1 //D. Using Theorem 11, we conclude that x9/y> is a convergent 


to JD. 
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=m COROLLARY 20.1 / If Xo, yo is a positive solution of x? — y?D = +1, 
then 


Xo = Pi» Yo = Q,, 


where P,/Q, is a convergent to ./ D. 


PROOF: From Theorem 20 we know that x9/yo = P,/Q,,- 


Since the fractions are reduced to lowest terms, then 
Xo = P n? Jy (: os 0, 


Before proving that Pell’s equation has a solution, we need one additional 
fact about the continued fraction expansion of me D. 


m LEMMA / Let D be a positive integer that is not a square. Write es D= 
{A1, Az, ***5 Aq» %,>» There exist positive integers b, and c, such that 


Pn t/D 
64 


n 
n 


and P? — DQ,” =(—1)"c, 


PROOF: Since /D is a root of x? — D = 0, then by Theorem 17, «, is a root 
of 


A,x* + Bx +C, =0 
where A, =P,” — DQ,” 
B, = 2P,P,-1 — 2DQ,Q, - 1 
CSP e4 DO, a4" 
and B,” — 4A,C, =4D 


Note that A, is positive if n is even, and negative if n is odd. Thus, we may 
write A, = (—1)"c, where c,, is a positive integer. Therefore, 


PS = DQ,” ae A, a (—1)"c, 
as was to be proved. From the quadratic formula we obtain 


_ —B,+2,/D_ —B,+2,/D 
ny an bye 
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We established in Corollary 19.1 that the continued fraction expansion of «@,, is 
purely periodic. Therefore, by Theorem 19, 


—B,+2,/D 


l<a,= 
on = (17"2¢, 
—B, ¥ —B,F2/D © 
d —l<¢’.= 
an <a’, (—1)"2e, 
Adding, we obtain 
0<a,+ Fn 
a’, 
ieee 1G) 


so that B, is positive ifm is odd, and negative if n is even. From the formula for 
B,, we see that B, is even. Thus, we can write 


= (—1)"* 126, 
where J, is a positive integer. If we subtract the above inequalities, we obtain 
+2,/D 
(—1)"c, 
so that the sign of +./ D is the same as that of (— 1)". Therefore, 


l<a,-—a,= 


—~B,+(—1)"2,/D (—1)"* 22b,+(-1)"2,/D__ b, + ./D 


— SS | AES 
—- 


a THEOREM 21 / Let D bea positive integer that is not a square. Let m be 
the period of the expansion of ./ D as a simple continued fraction. 
(a) If mis even, the positive solutions of x? — y?D = 1 are 


X = Pans VY = Quem (kK =1, 2, 3, ---) 


The diophantine equation x? — y?.D = — 1 has no solution. 
(b) If mis odd, the positive solutions of x? — y?D = 1 are 


X = Pin, Y = Qh (kK = 2, 4, 6, ---) 
The positive solutions of x? — yD = — lare 
X = Prams VY = Quem (kK = 1, 3, 5, ++) 
PROOF: By Corollary 20.1 we know that any positive solution x9, yo of 


x? — Dy? = +1 is of form xo = P,, Yo = Q, where P,/Q, isa convergent to 
./ D. By the lemma we know that 


Pe > DQ,’ =(—1)"c, 
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where c, is a positive integer such that «, = (b, + wh D)/c,. Since the expansion 
of a = [./D] + VJ Dis purely periodic with period m, we can write 


o= «2a, a2, °°", Aim) = <2a,, QA, °***s Am at» 


y, — b/P1+ VD 
si 1 


so that 


Thus, ¢;,, = 1, which implies that 


X = Pim Y = Qe 


is a solution of x? — y*D =(—1)*". If m is even, then P,,,, Qym iS a solution of 
Pell’s equation for each positive integer k. If m is odd, then P,,,, Qjm is a solu- 
tion of Pell’s equation if k is even, and is a solution of 


x? -y*D=-1 


if kK is odd. 
To prove that these are the only solutions of x? — y*D = + 1, we need only 
show that if c, = 1, then n = 0 (mod m). Note that if c, = 1, then 


a, =b, + /- D 
Because the expansion of a, is purely periodic, then by Theorem 19, 
—l<a’,=b,-JD<0 

and so J/D-1<b,<,/D 
Because , is an integer, this implies that b, =[./ D] and so 

4, = [./ D] +,/ D 
which can only be the case when n = 0 (mod m). 
m COROLLARY 21.1 / Let D be a positive integer that is not a square. Let 
m be the period of the expansion of ./ D as a simple continued fraction. The 


fundamental solution of Pell’s equation is x = P,,, y = Q,, if m is even, or 
xX = Pia J = O>m if m iS odd. 


EXERCISES 


1. Calculate the fundamental solution of 
x? — 94y? = 1 


Does x? — 94y? =—1 have a solution? 
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2. For which integers, n, 0 < |n| < af 94, does 


x? — 94y* =n 
have a solution? 


12.10* / THE DIOPHANTINE EQUATION x2 — y2D=n 


The diophantine equation 
x*—y*D=n (12.5) 


was considered briefly in Theorem 20. We established that if 0 < |n| < ) D, 
any positive solution of this equation satisfies x/y = P,/Q, for some positive 
integer k. Let m be the period of the expansion of ./ D. If (12.5) has a solution 
for 0 < |n| <./D, it follows from the lemma preceding Theorem 21 that an 
infinity of solutions exist, at least one obtained from the first m convergents or 
from the first 2m convergents, accordingly as m is even or odd. Consequently, 
in this section we will be primarily interested in solving Equation (12.5) when 
In| > JD. 


a DEFINITION / A solution xo, yo of x? — y?D = nis said to be primitive if 
Xo and yo are relatively prime. 


It is obviously sufficient to obtain only primitive solutions, because if 
(Xo, Yo) = d > 1, then x,/d, yo/d is a primitive solution of 


n 


SEP Das, 


ws THEOREM 22/ If x9, yo is a primitive solution of x? — y?D =n, there 
exists an integer & such that 


Xo = kyo (mod n) 
and k? = D(mod n) 


PROOF: Since Xo and yo are relatively prime, then so are y, and n. Thus, 
there exists an integer k such that 


Xo = kyo (mod n) (Theorem 4 of Chapter 3) 
Therefore, 
Xo? — Dyo? = k* yo? — Dy? = yo2(k? — D) = 0 (mod n) 
which, because (y”, 2) = 1, implies that 
k? = D(mod n) 
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mw COROLLARY 22.1 / If Disa quadratic nonresidue (mod n), the diophan- 
tine equation 


x* — Dy* =n 
has no primitive solutions. 
m DEFINITION / We say that the solution x9, yy of Equation (12.5) belongs 


to the integer k if 
Xo = kyo (mod n) 


and k? = D(mod n) 
It is easy to see that if (12.5) has a positive solution, then it has an infinity of 


such solutions. Suppose that x9, yo is a solution and that s, ¢ is a solution of 
the corresponding Pell’s equation. If we define x,, y, by 


x4 + yD = (xo + yo D\(s + tx/D) 


it follows that x,, y, is also a solution of (12.5). Because Pell’s equation has an 
infinity of positive solutions s, t, this implies that (12.5) has an infinity of cor- 
responding positive solutions. 


a DEFINITION / Two solutions x,, y, and x,, y, of (12.5) are said to be in 
the same class if there is a solution Xo, Yo of (12.5) and solutions s,, ¢, and s5, 
t, of Pell’s equation such that 


x, + yD = (Xo + Yow DS: + tv D) 
and X2 + VoD = (xo + Yow D\(s2 + trV/ D) 
We will now prove a fundamental theorem concerning the classes of solu- 


tions of Equation (12.5). 


=» THEOREM 23 / Two primitive solutions of x? — y?D = nare in the same 
class if, and only if, they belong to the same integer. 


PROOF: (Part I) Suppose x,, y, is a primitive solution of (12.5) and that 
x1 + Yiv/D = (Xo + Yow Ds + t/D) 
where 5s, fis a Solution of Pell’s equation and Xo, yp is a solution of (12.5). Then 
X, = X05 + YotD, Vy =Xot + Vos 


which implies that x9, yo is also a primitive solution. Thus, xo, yo belongs to 
an integer k. Substituting, we obtain 


yk = Xotk + yosk = yoktk + Xos = YotD + xXos = xX, (mod n) 
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Thus, x,, y,; and Xo, Yo belong to the same integer k. If x2, y2 is in the same 
class with x,, y,, it follows similarly that x,, y, also belongs to k. 
(Part II) Suppose x,, y, and x,, y, belong to the integer k. Then 


X2 + Y2/D - (x2 + Yor/ Dx oa Y1y/D) 
Xy+ y1/D (x, + y1,/D\(x, = y1,/D) 


X4X5o— D — 
ai Yi2 4 X1Y2 ot DS 


n n 
Since 
x, = y,k (mod n), x, = yk (mod n), and k? = D (mod n) 
then X1X_ = Vi V2k? = y,y2D (mod n) 
and X1V2 =Vi 2k = X2y, (mod n) 


Thus, the two fractions in the right-hand side of (12.6) are integers, say 


X4X2 — Wi 2D = X1V2— X2y, on 
n : n 


ee 


Xi + y1/D 


Similarly, x2 Yay _ — t./D 


—y a a 
Therefore, 


on Nn Dx X2 — Ya/D 
s? — 17D =(s + t./D\(s — t,/D) = a x2 + Yay D = 
v v Xy + x4 +Y1/D Xi x1 — ¥1/D 


and so 


which means that s, ¢ is a solution of Pell’s equation. Since 


X2 + Yo D =(x, + iV DMs + ty/D) 


then x,, y, and x,, y, are in the same class. 


=m COROLLARY 23.1 / The number of classes of primitive solutions of 
x? — y*D = nis finite. 


We will now develop a method to solve (12.5) when |n| > ae D. This method, 
a variation on one due to Lagrange, reduces the problem to that of solving a 
related diophantine equation 


x?—y*D=m 


where [m| < |n| 
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= THEOREM 24 / Let k be an integer such that k? = D (mod n). Let 


_k?—D 
7 n 


m 


The diophantine equation x? — y?D =n has a positive solution belonging 
to the integer k if, and only if, 


t?7-uwD=m 


has a solution fg, Up (not necessarily positive) belonging to k such that 


ku, —t 
~_O *O > @) 
m 
D — kt 
and Oe >0 
m 
If this is the case, then 
UgD a kto kuo me to 
Xo = ——> eS 
m m 


is a positive solution of x? — y?D =n, belonging to k. 
PROOF: If Xo, Yo 1S a positive solution of (12.5) which belongs to k, we can 
write 
Xo = Yok — ugn, D = k* — mn 


where up is an integer. If we substitute these values of x, and D into (12.5) and 
simplify, we obtain 


orm — 2yVokuo + (uo?n = 1) = 0 
which, by the quadratic formula, implies that 


kg £ \/k7uy? — muy’n +m __ kuy + ./Duy? +m 
m 


Yo a 


Because yy is an integer, the value of the radical ./ Duy? +m must be an 
integer, fp, implying that fo, up is a solution of the diophantine equation 
t7—~y?D=m 
For the same reason we must have 
kup + to = 0 (mod m) 
By varying the sign of tg we may assume that 


kup = to (mod m) 
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so that tp, uo belongs to the integer k. Thus, 


kug — to 
= 
m 
and 
k*uy —ktp —upmn uD — kt 
w=)“ i1nS ee 
m m 
which implies that 
kuy —t UD — kt 
sec A and uid alee rad 
m m 
are positive. 


To prove the converse, suppose that fp, uy is a solution of t? — u?7D =m 
which belongs to k, and that 


ku, —t D—kt 
POG. ser 
m m 
Define x, and yo by 
UD — kto kuo a to 
X9 = —————> = ————— 
0 a Yo - 


By reversing the above steps, we find that 
Xo = Yok — uon 
so that x, is an integer congruent to yok (mod n), and 


ae — Yo" D =n 


In the above proof the inequalities 


ku, — t 
eked 
m 
D—kt 
and i acl 
m 


are only needed to insure that x9, yo is positive. If we drop the requirement 
that xo, Vo is to be positive, we obtain |xo|, |yo| as a positive solution which 
may belong to —k rather than k. By a simple modification in the above proof 
we obtain the following theorem: 


= THEOREM 25/ Let k and m be as in Theorem 24. The diophantine 
equation x? — y*D =nhas a positive solution belonging to +k if, and only if, 
the equation 

??7—u*D=m 
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has a positive solution fo, uy belonging to +k. If this is the case, 


UD + ktp 
m 


to + kuo 
m 


9 — 


Xo = 


is a positive solution of x? — y?D =n which belongs to +k, the sign being 
chosen so that ty + kuy = 0 (mod m). 


If |n| > / D, the number m (in Theorems 24 and 25) is either less than ./ Din 


absolute value or is less than |n|/4. To prove this, we note that k can obviously 
be picked in the range 0 < |k| < n/2. If k* — D>, then 


k?—D| __ k? 2|* 
ml =| Jc lw 


In| |n| 4 

Ifk? — D <0, then 
k? —D D D ae 
vs n | Pa tale 


By applying the method repeatedly, we eventually obtain an equation 
t’?—y'*D=m' 


where |m’| </D. A word of caution: In applying the method repeatedly we 
must pick different values of k at each step. 


Example |. 
Show that x?+ 13y? = 51 has a solution. 
SOLUTION: We find that k = +8, s = 1. The auxiliary equation to be solved is 
t?— 13u? =1 
The fundamental solution of this equation is 
t = Pio = 649, u = Qio = 180 


Because this solution belongs to 8 (and to —8 as well), we obtain 


180-13 —8- 649 

x= —— = 2852 
649 — 8- 180 

Yo= a ai = 791 


Note that this solution belongs to —8, rather than to 8. 
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Example 2. 


Show that x? — 82y? = 23 has no solution. 
SOLUTION: We find that k = +6 are the only values of k in the range 
23 
O<|k[< > 


such that k? = 82 (mod 23). Thus 


Since V'82 = <9, 18), the only equations of form 
x?—82y?=n (0<|n| < V82) 


that have solutions are 
x? — 82y?=+1 


Thus, x? — 82y? =m= —2 


has no solution, and, by Theorem 24, x? — 82y” = 23 has no solution. 


EXERCISES 


1. Determine the classes of solutions for the following equations: 
(a) x* — 23y? = 43; 
(b) x* — 23y? = 41. 
2. Prove that solutions x,, y, and x3, y, of x? — y*D =nare in the same class 
if, and only if, 
X4X2 — Wy y2D = y,X2 — X12 = 0 (mod D) 


12.11* / LATTICE POINTS ON CONICS 


The theorems in Sections 12.9 and 12.10 can be interpreted as statements 
about lattice points on certain hyperbolas in the X Y-plane. The problem of 
determining the lattice points on a given conic will now be considered. We 
obviously may restrict our investigation to conics that have defining equations 
with integral coefficients. 


THE ELLIPSE 


An ellipse (or circle) can have at most a finite number of lattice points which 
can be found by trial. 
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THE PARABOLA 
The defining equation for a parabola can be written in the form 
(ax + by)? +cex+dy+e=0 (12.7) 


where the coefficients are integers and D = ad — bc # 0 (see Problem 1). If we 
let z = ax + by, we obtain the system of equations 


ax+ by =z 
cx + dy = —z?—e 
that has the unique solution 


bz* + dz + be az* +cz+ae 
x -eOoO——_——_—_——_————-———- ———— 


D ae D 


Consequently, x, y is an integral solution of (12.7) if, and only if, there is an 


integer Zz, which satisfies the system of congruences 
az? + cz + ae = 0(mod D) 
(12.8) 
bz? + dz + be = 0 (mod D) 


Obviously, any solution Zp of (12.8) will determine a class of solutions of 
(12.7); that is, those solutions corresponding to values of z which are congru- 
ent to Z) (mod D). If Xo, yo is the solution of (12.7) corresponding to Zo, the 
general solution in this class can be written in parametric form as 


ee b(Zp + kD)? ue + kD) + be 


— Xo + (2bZo + d)k + bDk? 
y = g(k) = yo — (2aZ_ + c)k — aDk? (k =0, +1, +2, ---) 


Example |. 


The equation 
(2x + 3y)?+x—3y+7=0 
leads to the system of congruences 


2z7+2z=4 (mod 9) 
3z? — 3z = 6 (mod 9) 


which has no solution. Consequently, no lattice points lie on the graph of the 
given equation. 
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Example 2. 


Find all lattice points on the graph of 
(7x —y)? —x+y+6=0 


SOLUTION: Following the steps described above we find that z must satisfy 
7z? —z+42 = 0 (mod 6) 
—z?+ z—6=0 (mod 6) 


This system has four incongruent solutions (mod 6): Zo = 0, 1, 3, 4. The four corre- 
sponding classes of solutions are 


x= —1+k — 6k? x = —1—k — 6k? 
y= —7+k — 42k? y= —8+4+ 13k — 42k? 
x= —2 — 5k — 6k? x= —3 — 7k — 6k? 
y= —17 — 41k — 42k? y= —25 — 55k — 42k? 


THE HYPERBOLA 


The problem of determining all lattice points on a hyperbola is much more 
difficult. Consequently, we will restrict ourselves to the special case where the 
equation of the hyperbola can be written in the form 


Ax? + Bxy+Cy?=D A#0 


It is obviously no restriction to assume that Bis even. Thus, we may rewrite 
the above equation as 


ax? +2bxy+cy*=d a0 (12.9) 
If we multiply this equation by a and make the substitution 
s — bt 
x= yea 
a 
it is transformed into the equation 
s* — (b? — ac)t? =ad (12.10) 


It is easy to verify that any solution So, to of (12.10) that satisfies 
So = bty (mod a) 
will generate a solution 
So Bee, bto 


Xo = ——— ; Yo = lo 
a 3 


of (12.9). Thus, the problem of solving (12.9) reduces to that of solving (12.10), 
subject to the side condition s = bt (mod a). 
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Example 3. 


Find all solutions of 


2x? — 14xy + 13y? = 1 


SOLUTION: The substitution 


transforms the given equation into 
s? — 2317 =2 (12.11) 
which must be solved subject to the side condition 
s+ 7t=s+t=0 (mod 2) 
The fundamental solution of Pell’s equation 
u* — 230? = 1 


iS u, = 24, v; = 5. Equation (12.11) has only one class of solutions, in which the 
smallest positive solution is s = 5, t = 1. Thus, all solutions of (12.11) are of form 
+5,, -tn, where 


Sn + ty V23 = (5 + V23)(24 + 5 V23)" 
or Sn + th V23 = (—5 + V23)(24 + 5 V23)" 
and these solutions generate all solution of 

2x? — 14xy + 13y? = 1 


For example, when 1 = 1 we obtain s = 235, t = 49 from the first equation and 
s = 5, t = —1 from the second. The corresponding values of x and y are 


x = +289, y= +49 x=+l,y=+1 


EXERCISES 


Find all lattice points on the following conics. 
1. (3x + Ty)? +5x—y—12=0 
2. (8x + 5y)? —x+2y—9=0 
3. 3x? + 12xy + Ty? = —2 
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PROBLEM 


1. Let f(x) be an equation with integral coefficients which has a parabola as its 
graph. Show that the equation can be rewritten in the form 


(ax + by)* +cex+dy+e=0 


where ad — bc £0. [Hint: If f(x) = Ax? + Bxy + Cy? + Dx + Ey + F=0, 
then B* — 4A4C = 0.] 


12.12* / REMARKS ON HIGHER ORDER DIOPHANTINE 
EQUATIONS 


Let f(x, y) be an integral polynomial in two variables of degree n. The 
problem of solving the diophantine equation f(x, y) = 0 has been considered 
for the special cases n = 1 and n = 2 (Chapters 2 and 12). 

If n = 1, an infinity of solutions exist (if any exist at all) and they are distri- 
buted in a regular pattern. If n = 2, it is possible that f(x, y) = 0 may have an 
infinity of solutions, but they are no longer distributed in a simple manner. In 
most cases an equation of degree 3 or higher will have at most a finite number 
of solutions. 

In 1909, A. Thue proved that a diophantine equation of form 


A,X" + Ay — xX” ty tee bayxy"  ' +agy"=d (n>3) (12.12) 


has at most a finite number of solutions unless the left-hand side is a power of a 
linear or quadratic polynomial with integral coefficients. If this is the case, 
Equation (12.12) can be written as 


(ax + by)" =d 
or (ax? + bxy + cy?)"=d 


If a solution exists, then d must be the mth power of an integer d’ and any 
solution of 
ax+by=d' 


or ax* + bxy + cy? =d’ 
is a solution of (12.12). 


In 1921, C. L. Siegel proved that if f(x, y) is an irreducible polynomial’ of 
degree greater than 2, then the diophantine equation f(x, y) = 0 can have at 


2°“ Trreducible’’ means that f(x, y) can not be factored as a product of two 
polynomials of lower degree, each with integral coefficients. 
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most a finite number of solutions unless the following special condition holds: 
There exist numbers a, a, — 1, «°°, Ap, @-1, «°°, @_, and B,, b, _ 1, -*+, Bo, b_4, 
---, b_,, such that if we make the substitution 


: a_ a: 
X= (2) = dyZ" +o + az + ag FE HS 
; b_, b_, 
y = h(z) = 5,2 aaa 2 Ce | ae ge eae ae 


the expression F(z) = f(g(z), h(z)) is identically zero. 
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Table! / Prime Numbers less than 5000 


2 179 419 661 947 1229 1523 1823 2131 
3 181 421 673 953 1231 1531 1831 2137 
5 191 431 677 967 1237 1543 1847 2141 
7 193 433 683 971 1249 1549 1861 2143 
1] 197 439 691 977 1259 1553 1867 2153 
13 199 443 701 983 1277 1559 1871 2161 
17 211 449 709 991 1279 1567 1873 2179 
19 223 457 719 997 1283 1571 1877 2203 
23 227 461 727 1009 1289 1579 1879 2207 
29 229 463 733 1013 1291 1583 1889 2213 
31 233 467 739 1019 1297 1597 1901 2221 
37 239 479 743 102] 1301 1601 1907 2237 
4] 241 487 751 1031 1303 1607 1913 2239 
43 251 491 757 1033 1307 1609 1931 2243 
47 251 499 761 1039 1319 1613 1933 2251 
53 263 503 769 1049 1321 1619 1949 2267 
59 269 509 773 1051 1327 1621 1951 2269 
61 271 521 787 1061 1361 1627 1973 2273 
67 277 523 7197 1063 1367 1637 1979 2281 
71 281 541 809 1069 1373 1657 1987 2287 
73 283 547 81] 1087 1381 1663 1993 2293 
79 293 557 821 1091 1399 1667 1997 2297 
83 307 563 823 1093 1409 1669 1999 2309 
89 311 569 827 1097 1423 1693 2003 2311 
97 313 571 829 1103 1427 1697 2011 2333 
101 317 S77 839 1109 1429 1699 2017 2339 
103 331] 587 853 1117 1433 1709 2027 2341 
107 337 593 857 1123 1439 1721 2029 2347 
109 347 599 859 1129 1447 1723 2039 2351 
113 349 601 863 1151 1451 1733 2053 2357 
127 353 607 877 1153 1453 1741 2063 2371 
131 359 613 881 1163 1459 1747 2069 2377 
137 367 617 883 1171 1471 1753 2081 2381 
139 373 619 887 118] 1481 1759 2083 2383 
149 379 631 907 1187 1483 1777 2087 2389 
151 383 641 911 1193 1487 1783 2089 2393 
157 389 643 919 1201 1489 1787 2099 2399 
163 397 647 929 1213 1493 1789 2111 2411 
167 401 653 937 1217 1499 1801 2113 2417 
173 409 659 941 1223 1511 1811 2129 2423 
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Table | (continued) 


2437 2749 3083 3433 3733 4073 4421 4759 
2441 2753 3089 3449 3739 4079 4423 4783 
2447 2767 3109 3457 3761 4091 4441 4787 
2459 2777 3119 3461 3767 4093 4447 4789 
2467 2789 3121 3463 3769 4099 4451 4793 
2473 2791 3137 3467 3779 4111 4457 4799 
2477 2797 3163 3469 3793 4127 4463 4801 
2503 2801 3167 3491 3797 4129 4481 4813 
2521 2803 3169 3499 3803 4133 4483 4817 
2531 2819 3181 3511 3821 4139 4493 4831 
2539 2833 3187 3517 3823 4153 4507 4861 
2543 2837 3191 3527 3833 4157 4513 4871 
2549 2843 3203 3529 3847 4159 4517 4877 
2551 2851 3209 3533 3851 4177 4519 4889 
2557 2857 3217 3539 3853 4201 4523 4903 
2579 2861 3221 3541 3863 4211 4547 4909 
2591 2879 3229 3547 3877 4217 4549 4919 
2593 2887 3251 3557 3881 4219 4561 4931 
2609 2897 3253 3559 3889 4229 4567 4933 
2617 2903 3257 3571 3907 4231 4583 4937 
2621 2909 3259 3581 3911 4241 4591 4943 
2633 2917 3271 3583 3917 4243 4597 4951 
2647 2927 3299 3593 3919 4253 4603 4957 
2657 2939 3301 3607 3923 4259 4621 4967 
2659 2953 3307 3613 3929 4261 4637 4969 
2663 2957 3313 3617 3931 4271 4639 4973 
2671 2963 3319 3623 3943 4273 4643 4987 
2677 2969 3323 3631 3947 4283 4649 4993 
2683 2971 3329 3637 3967 4289 4651 4999 
2687 2999 3331 3643 3989 4297 4657 

2689 3001 3343 3659 4001 4327 4663 

2693 3011 3347 3671 4003 4337 4673 

2699 3019 3359 3673 4007 4339 4679 

2707 3023 3361 3677 4013 4349 4691 

2711 3037 3371 3691 4019 4357 4703 

2713 3041 3373 3697 4021 4363 4721 

2719 3049 3389 3701 4027 4373 4723 

2129 3061 3391 3709 4049 4391 4729 

2731 3067 3407 3719 4051 4397 4733 

2741 3079 3413 3727 4057 4409 4751 
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23 


29 


31 


37 


Table Il / Table of Indices for Primes <100 


Modulo 17, the index of 8 is 10. The number which has index 8 is 16 


n 
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anti-ind n 
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2 
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10 
13 


27 
26 


8 


10 


16 


9 


10 


25 
10 


11 


13 
11 


12 


26 
12 
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21 
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56 
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57 
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43 
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31 
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45 
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Table Il / Continued Fraction Expansion of .{D 
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Table I11 (continued) 
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Table 111 (continued) 


4, 16 


8, 


68 


2, 3, 3, 2, 18 


9, 


* 89 


18 


90 
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1.4. 
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SUGGESTIONS AND ANSWERS 
FOR SELECTED EXERCISES 


Chapter 1 


Exercise 3. Let N(m) be the minimum number of moves needed to shift 
the first m disks to one of the other posts. Show that 


N(m + 1) = 2N(m) + 1. 
Use this result to prove that N(m) = 2” — 1. 


Problem 1. If T is the set of all numbers of form n—=s, then T must 
have a smallest element, ¢. Show that n — t is the largest element of the 
original set. 


Exercise 2(a). q = —4, r = 3. 


Exercise 4(a). If a=2m and b=2n, then a+b=2(m+n). Since 
m + nis an integer, then a + b is even. 


Exercise 6. Ifa = 4n + 1, then a = 16n? + 8n + 1 = 8(2n? +n) + 1. The 
proof is similar if a = 4n + 3. 


Exercise 1. 3. Exercise 3. If d=(n,n + 1), then d divides 1, the differ- 
ence of the two numbers. Because the only positive divisor of 1 is 1, 
then d = 1. 


Problem 3. Write 1 = ma + nb and 1 = xa + yc. Multiplying, we obtain 
1 =(mxa + nyc + nxb)a + (ny)bc. Because | is a sum of multiples of 
a and bc and is also the smallest positive integer, then 1 = (a, bc). 


Exercise 1. 300. Exercise 2. Apply the theorem using the fact that 
(a, b) = 1. 


Exercise 5. At least one of the factors in the product must be even and 
one must be divisible by 3. Since (2, 3) = 1, then 2-3 must divide the 
product. 


Problem 3. Let D,_, =[a,,-°°,a,-,], D, =[a,,°°:,@,] and D= 
[D,-1,4,]. Since D,_,|D, and a,|D,, then D|D,. Therefore, 
D < D,,. Since Dis acommon multiple of a,, a, ---, a, and D, = [a,, ---, 
a,|, then D, < D. Therefore, D = D,. 
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1.6. 


1.8. 


2.2. 


2.3. 


3.1. 


3.2. 


3.3. 


3.4. 


3.5: 


Exercise 2(a). (2030, 714, 2205) = ((2030, 714), 2205) =(14, 2205) =7. 
7 =1-2205 — 157-14 = 1 - 2205 — 157- (19 - 2030 — 54-714) 
= 1 - 2205 — 2983 - 2030 + 8478 - 714. 


Exercise 3. Write « =[«] +6, 0<0< 1. Divide [«] by m, obtaining 
[oJ =qm+r,0<r<m-—1. Then 


Fie lee ee wo Gauegt 


Since g is an integer and 0 < (r + 8)/m < 1, then 
at [«] 
]-+-[2 
m m 


Chapter 2 


Exercise 2(a). x = 6 + 29k, y = 19k. 
Exercise 3(a). x = 6 + 29k, y = 19k for all k > 0. 


Exercise 1. w = 27 — 10t — 2x + 6y, x = x,y =y, Z =3t —2y. 
Exercise 3. 82 Type A and 13 Type B or 41 Type A and 42 Type B. 


Chapter 3 


Problem 1. If m =0, the concept reduces to that of equality. 


Exercise 3. (a) Not symmetric; (c) yes. 


Exercise 4. If a? = 1 (mod p), then p|(a — 1)(a + 1). Since p is a prime 
number, then p|(a — 1) or p|(a + 1). 


Problem 3. (g — 1)" =(—1)" (mod g) and (g + 1)" = 1 (mod g). 
Exercise 1. (a) x = 47 (mod 52); (6) x = 5, 35, 65, 95 (mod 120). 


Exercise l (b). Rr = R_a7 R _ 80 = Rs. 
Exercise 2 (b). x is in Bs; VU B35, VU Bes VU Bos (mod 120). 


Exercise | (a). #272 (mod 3885). 
Exercise 3. 119 + 420k, where k is a nonnegative integer. 


Exercise 4. 17 + 12k, where k is a nonnegative integer. 


Problem 2. Show that the given number is a solution and apply 
Corollary 6.1. 
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3.6. 


4.1. 


4.2. 


4.3. 


4.4 


Exercise 2 (b). No solution; (c) x = +9 (mod 23). 


Exercise 3. Let f(x) = 2x* — 25x? + 9x + 1. The solutions of f(x) =0 
(mod 5) are x = 2, 4 (mod 5). The only solution of f(x) = 0 (mod 11) 
is x = 3 (mod 11). The solutions of f(x) = 0 (mod 13) are x = 1, 2, 3 
(mod 13). Thus, there are six solutions of f(x) =0 (mod 5- 11 - 13): 
x = 14, 157, 289, 432, 509, 652 (mod 715). 


Problem 4. Suppose that /(k) =p (a prime). If K =k (mod p), then 
J(K) = 0 (mod p). If nis the degree of f(x), there are at most n such 
values of K for which f(K) = p. For any other value of K = k (mod p), 
J(K) must be composite if it is positive. 


Chapter 4 


Problem 2. Ifa, + bjm = a,n + b,m (mod mn), then ajn = a,n (mod m), 
implying that a; = a, (mod m), since (m, n) = 1. Therefore, i = u. Simi- 
larly, 7 =v. Thus, the mn elements in the set are incongruent (mod mn). 
By Theorem 1, this implies that the set is a complete system of residues 
(mod mn). 


Problem 1. It follows from Problem 2, Section 4.1, that the elements 
are incongruent (mod mn). Since (a+ b,m,m)=(ajyn,m) and 
(a;, m) = (n,m) = 1, then (a;n + bm, m) = 1. Similarly, (a,n + bjm,n)=1. 
Therefore, a,;n + b,m is relatively prime to mn. Let N be relatively prime 
to mn. Since m and n are relatively prime, there exist integers a and b 
such that N = an + bm. Since the greatest common divisor of a and m 
divides N, then (a, m) = 1. Therefore, a is congruent (mod m) to one of 
the elements in the reduced system of residues (mod m), say a =a, + k,m. 
Similarly, b = 5; + kjn. Substituting, we obtain 


N = (a, + kym)n + (6; + kjn)m = ajn + bm (mod mn). 


Since each element relatively prime to mn is congruent (mod mn) to one 
of the elements in the set, it follows that the set is a reduced system of 
residues (mod mn). 


Exercise 1. @(873) = o(37) @(97) = 6: 96 = 576. 


Exercise 2. Show that @(p") is even if p is an odd prime and ¢(2") is 
even if m > 2. Use the multiplicative property of q. 


Exercise 4. No solution. 


Exercise 1. 4. 


Problem 1. Use the fact that k!|p(p — 1)---(p —k + 1) and (K!, p) = 1. 
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4.5. 


Exercise 2. (a) No; (5) No. 


4.7. Exercise 3. 6. 


4.8. 


Problem 1. Let a belong to the exponent m (mod p”). Then a” = 1 
(mod p) and so k|m, say m = kn. Since (a* — 1)? = a“? — 1 = 0 (mod p), 
then kn|kp and so n|p. Since p is a prime, this means that n = 1 or 
n =p, that is, m =k or m= kp. 


Exercise 1. m =7: 3 and 5. Exercise 2. None. 


App. Exercise 1(b). 2, 3, 8, 11, 12, 18, 25, 27, 31, 32, 44, 48, 49, 53, 65, 


5.1. 


3.4. 


5.5. 


6.2. 


6.3. 


7.2. 


66, 70, 72, 79, 85, 86, 89, 94, 95. 


Problem 2. Let d = (b, p — 1). The congruence has a solution if, and 
only if, d|(ind c — ind a). If this is the case, there are exactly d incon- 
gruent solutions (mod p). 


Chapter 5 


Exercise l(c). 56.06316344506035032216,. 
Problem 1. Compare the series to the geometic series 


pipe eat 
10 10° 10° 


Exercise I(a). 528. 


Exercise 1(b). 1392, third position. 
Chapter 6 

Exercise I(a). 140, 6. 

Exercise 3. a, and b,, a, and by. 


Chapter 7 


Exercise 2(b). 1, 6, 6, 21, 6, 36, 6, 56, 21, 36. 
Exercise 3. 6. 21. 


Problem 1. Use mathematical induction to prove that (@ 0 t)(p”) = o(p"). 
Use the multiplicative property of @ © T. 
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7.3. Exercise 1. 1, —2, —3, 0, —5, 6, —7, 0, 0, 10. 
Exercise 4. Consider (« o B)(1). 


Problem 3(d). Let a(n) =n and define 6B by a+#f =t. If p and q are 
distinct primes, then B(pqg) # B(p) : B(q). 


Problem 4. Prove the formula for the power of a prime and use (0) of 
Problem 3. 


Chapter 8 


App. B. Exercise I(a). 1535 - 1987. 
Exercise 2. If x —y2>c, then x+y < N/c. Subtracting, we obtain 
y <(N — p*)/2p. Therefore, 


N — p*\2 N 2\ 2 
vaneyens(“P) -( s.| 
2p 2p 


App. C. Exercise 1(a). 23 - 59. 


Chapter 9 


9.1. Exercise 1. Quadratic residues: 1, 2, 4, 8, 9, 13, 15, 16. 
Problem 1. If x? =a and y* = b (mod m), then (xy)? = ab (mod m). 
Problem 4. Use Problem 3 and Theorem 7 of Chapter 3. 

9.2. Problem 2. Let a, be the associate of b,. Since 1 and p — 1 are their 


own associates and this 1s not the case for the numbers between 1 and 
p — 1, then 


(p— 1)!=1- a,b, + a,b, + QAip - 1)/29(p - 1)/2 ‘(p—-)D=0- 1) 
— 1 (mod p) 


9.3. Problem 2. Obviously, the numbers gq, 2q, ---, (gq — 1)g are quadratic non- 
residues. If one or more of these numbers is greater than p, there is a 
smallest such number, say kq. Since p is a prime, then 


(A — l)q<p<kq 


and so kq < p +q. Thus, kg is congruent (mod p) to a quadratic residue 
and is itself a quadratic residue—a contradiction. Therefore, each of 
the numbers is less than p. Since (¢ — 1)q < p, then 


ge ee 
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9.5. 


9.7. 


Problem 5. Suppose that m, m+ 1, ---, m+k-—1 all have the same 
quadratic character. It is easy to see that m > 1. We may thus assume 
that m — 1 has a different quadratic character than does m. Since the 
numbers km, k(m + 1), ---, km + k — 1) have different quadratic charac- 
ter than m, these numbers are incongruent (mod p) to the numbers 
m,m-+1,---,m+k— 1. Thus, there exists a nonnegative integer n such 
thatkm >np+m+k-—landk(m+k—1)<(n+ 1)p +m. Solving the 
two inequalities for np, we obtain 
km+k?—k—-—p—m<np<km—m—k+1 
so that k? <p. 


Exercise l(a). +1. 
Exercise 3. p = +1 (mod 24). 
Problem 1. g = 5, p= +1 (mod 5). 


Exercise l(c). x = 1 and x = 36 (mod 79) are solutions. 


Problem 2. If xo>* =a+ mp then (xo + kp)* =a+ mp + 2xokp + kp? 
=a+(m + 2xok)p (mod p”). If ky is a solution of 2x9k = —m(mod p), 
then (x) + kop)” = a (mod p”). Extend this result by induction. 


Chapter 10 


10.2. Exercise 2. (a) Yes; (b) No. 


Exercise 3. Write d as a product of primes and use the unique factori- 
zation theorem. 


10.3. Exercise 2, Case II. If all of the numbers are even or all are odd, then 


Xo” + Yo” + 207 + Uo” = 0 (mod 4). If one is even and three are odd or 
one is odd and three are even, then x)? + yo* + Zp” + Up” = +1 (mod 4). 
Thus, two are even and two are odd. A similar argument holds for 
Case III. 


Problem 1. If x, y, z are all odd, then x” + y? + z? = 3 (mod 8). If two 
of the numbers are odd and one is even, the sum of the squares is 
congruent to 2 (mod 4). Similar contradictions occur in the other cases. 


Chapter 11 


11.1. Problem 2(b). x,4, =X; +2, 4, = 2%, +9,+2, 2:41 =Vi4,t 1. 


Problem 3. Use Problem 2(a) to show the existence. If x is a prime and 
x=s*—t?, then s—t=1 and s+t=x. Therefore, s =(x + 1)/2 and 
t =(x — 1)/2. 
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Problem 4. x = |2s? — t?|, y = 2st, z = 2s? + t7, where ¢ is odd and is 
relatively prime to s. 


m?* — | 2m 
ote P ] Fs = —_—=_—_- == ‘ 
11.2. Problem 1. x ("5 ri i)a, y (— ri Ja 


11.4. Problem l(c). Use one of the half-angle formulas from trigonometry. 
a= |s*+t*— 6s7t?|, b = 4st|s? — t7|, c =(s? + t”)* where s and ¢ are 
relatively prime positive integers, one even and one odd. 


Chapter 12 


12.3. Exercise 2. (1, 2, 2, 3, 1, 5). 
Exercise 4(a). 31, 22. 
Problem I(a). Suppose that 
alb = (aj, Az, -**, Ay = 1D = (Cy, Ca, °°, Cm = DD 


where m < n. Since a, and c, are integers and 


< | and O< <1 


a,+’. C> 


O< 


then a,;=c,; and a), ---,a,> = €C2, °°, Cm>. If we assume that 
a, =c; G@=1, 2, --,k) (k<m-—1), it follows as above that a; =c; 
(i = 1, 2, ---, m— 1) and 

1 1 


CG: 


Thus, it is impossible to have a,, > 1 orn >m. 


12.5. Exercise 1(a). (5 + ,/85)/10. 
Exercise 2. ./15 = <3, 1, 6>. 


Problem 1. Since a, = C,; <a < C, =a, + l/a, < a, + 1, then [a] = a,. 
Use the limit theorems of elementary calculus to prove that 


l l 
lim (« +u—s} =a, + -—— 
n> 0 Cay, “fy a,» : <a, ree) 
12.8. Exercise 3(b). <6, 1, 5, 1, 12). 
Problem 1. Since 


__ a 1 
meee) MEMS a J/D+m me n+ JD 
D—»m 
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then m+./D = (2m). Therefore, 


= = 1 1 soe. 
= D—- = = —_——- = 
J/D=m+(/D-—m)=m+ Dea m + oes <m, 2m> 


12.9. Exercise 1. x = 2,143,295, y = 221,064. No. 
Exercise 2. —5, 1, 2, 3, 4, 6, 8, 9. 
12.10. Exercise 1 (a). No solution. (6) k = 8, x9 = 307, yo = 64; k = —8, 
Xo =— 77, Yo = 16. 
12.11. Exercise 1. z) = —1, x =2 — 15k — 266k”; y = —14+k + 114k?; 
Zo = —7, X = —7 — 99k — 266k”, y =2 + 37k + 114k?; 
Zy = 12, x = —24+4+ 167k — 266k?, y = 12 — 77k + 114k?; 
Zp = 18, x = —57 + 251k — 266k?, y = 27 — 113k + 114k?. 
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